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Series Preface 


Mathematics is playing an ever more important role in the physical and 
biological sciences, provoking a blurring of boundaries between scientific 
disciplines and a resurgence of interest in the modern as well as the classical 
techniques of applied mathematics. This renewal of interest, both in research 
and teaching, has led to the establishment of the series: Texts in Applied 
Mathematics (TAM ). 

The development of new courses is a natural consequence of a high level 
of excitement on the research frontier as newer techniques, such as numerical 
and symbolic computer systems, dynamical systems, and chaos, mix with and 
reinforce the traditional methods of applied mathematics. Thus, the purpose 
of this textbook series is to meet the current and future needs of these 
advances and encourage the teaching of new courses. 

TAM will publish textbooks suitable for use in advanced undergraduate 
and beginning graduate courses, and will complement the Applied Mathe- 
matical Sciences (AMS ) series, which will focus on advanced textbooks and 
research level monographs. 


Preface to the Fourth Edition 


There are two major changes in the Fourth Edition of Differential Equations 
and Their Applications. The first concerns the computer programs in this text. 
In keeping with recent trends in computer science, we have replaced all the 
APL programs with Pascal and C programs. The Pascal programs appear in 
the text in place of the APL programs, where they are followed by the Fortran 
programs, while the C programs appear in Appendix C. 

The second change, in response to many readers’ suggestions, is the in- 
clusion of a new chapter (Chapter 6) on Sturm-Liouville boundary value 
problems. Our goal in this chapter is not to present a whole lot of technical 
material. Rather it is to show that the theory of Fourier series presented in 
Chapter 5 is not an isolated theory but is part of a much more general and 
beautiful theory which encompasses many of the key ideas of linear algebra. 

To accomplish this goal we have included some additional material from 
linear algebra. In particular, we have introduced the notions of inner product 
spaces and self-adjoint matrices, proven that the eigenvalues of a self-adjoint 
matrix are real, and shown that all self-adjoint matrices possess an ortho- 
normal basis of eigenvectors. These results are at the heart of Sturm-Liouville 
theory. 

I wish to thank Robert Giresi for writing the Pascal and C programs. 


New York City Martin Braun 
May, 1992 


Preface to the Third Edition 


There are three major changes in the Third Edition of Differential Equations 
and Their Applications. First, we have completely rewritten the section on 
singular solutions of differential equations. A new section, 2.8.1, dealing 
with Euler equations has been added, and this section is used to motivate a 
greatly expanded treatment of singular equations in sections 2.8.2 and 2.8.3. 

Our second major change is the addition of a new section, 4.9, dealing 
with bifurcation theory, a subject of much current interest. We felt it 
desirable to give the reader a brief but nontrivial introduction to this 
important topic. 

Our third major change is in Section 2.6, where we have switched to the 
metric system of units. This change was requested by many of our readers. 

In addition to the above changes, we have updated the material on 
population models, and have revised the exercises in this section. Minor 
editorial changes have also been made throughout the text. 


New York City 
November, 1982 Martin Braun 


Preface to the First Edition 


This textbook is a unique blend of the theory of differential equations and 
their exciting application to “real world” problems. First, and foremost, it 
is a rigorous study of ordinary differential equations and can be fully 
understood by anyone who has completed one year of calculus. However, 
in addition to the traditional applications, it also contains many exciting 
“real life” problems. These applications are completely self contained. 
First, the problem to be solved is outlined clearly, and one or more 
differential equations are derived as a model for this problem. These 
equations are then solved, and the results are compared with real world 
data. The following applications are covered in this text. 

1. In Section 1.3 we prove that the beautiful painting “Disciples of 
Emmaus” which was bought by the Rembrandt Society of Belgium for 
$170,000 was a modern forgery. 

2. In Section 1.5 we derive differential equations which govern the 
population growth of various species, and compare the results predicted by 
our models with the known values of the populations. 

3. In Section 1.6 we derive differential equations which govern the rate at 
which farmers adopt new innovations. Surprisingly, these same differential 
equations govern the rate at which technological innovations are adopted in 
such diverse industries as coal, iron and steel, brewing, and railroads. 

4. In Section 1.7 we try to determine whether tightly sealed drums filled 
with concentrated waste material will crack upon impact with the ocean 
floor. In this section we also describe several tricks for obtaining informa- 
tion about solutions of a differential equation that cannot be solved 
explicitly. 
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5. In Section 2.7 we derive a very simple model of the blood glucose 
regulatory system and obtain a fairly reliable criterion for the diagnosis of 
diabetes. 

6. Section 4.5 describes two applications of differential equations to 
arms races and actual combat. In Section 4.5.1 we discuss L. F. Richard- 
son’s theory of the escalation of arms races and fit his model to the arms 
race which led eventually to World War I. This section also provides the 
reader with a concrete feeling for the concept of stability. In Section 4.5.2 
we derive two Lanchestrian combat models, and fit one of these models, 
with astonishing accuracy, to the battle of Iwo Jima in World War II. 

7. In Section 4.10 we show why the predator portion (sharks, skates, rays, 
etc.) of all fish caught in the port of Fiume, Italy rose dramatically during 
the years of World War I. The theory we develop here also has a 
spectacular application to the spraying of insecticides. 

8. In Section 4.11 we derive the “principle of competitive exclusion,” 
which states, essentially, that no two species can earn their living in an 
identical manner. 

9. In Section 4.12 we study a system of differential equations which 
govern the spread of epidemics in a population. This model enables us to 
prove the famous “threshold theorem of epidemiology,” which states that 
an epidemic will occur only if the number of people susceptible to the 
disease exceeds a certain threshold value. We also compare the predictions 
of our model with data from an actual plague in Bombay. 

10. In Section 4.13 we derive a model for the spread of gonorrhea and 
prove that either this disease dies out, or else the number of people who 
have gonorrhea will ultimately approach a fixed value. 

This textbook also contains the following important, and often unique 
features. 

1. In Section 1.10 we give a complete proof of the existence—uniqueness 
theorem for solutions of first-order equations. Our proof is based on the 
method of Picard iterates, and can be fully understood by anyone who has 
completed one year of calculus. 

2. In Section 1.11 we show how to solve equations by iteration. This 
section has the added advantage of reinforcing the reader’s understanding 
of the proof of the existence—uniqueness theorem. 

3. Complete Fortran and APL programs are given for every computer 
example in the text. Computer problems appear in Sections 1.13-1.17, 
which deal with numerical approximations of solutions of differential 
equations; in Section 1.11, which deals with solving the equations x = f(x) 
and g(x)=0; and in Section 2.8, where we show how to obtain a power- 
series solution of a differential equation even though we cannot explicitly 
solve the recurrence formula for the coefficients. 

4. A self-contained introduction to the computing language APL is 
presented in Appendix C. Using this appendix we have been able to teach 
our students APL in just two lectures. 
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5. Modesty aside, Section 2.12 contains an absolutely super and unique 
treatment of the Dirac delta function. We are very proud of this section 
because it eliminates all the ambiguities which are inherent in the tradi- 
tional exposition of this topic. 

6. All the linear algebra pertinent to the study of systems of equations 1s 
presented in Sections 3.1-3.7. One advantage of our approach is that the 
reader gets a concrete feeling for the very important but extremely abstract 
properties of linear independence, spanning, and dimension. Indeed, many 
linear algebra students sit in on our course to find out what’s really going 
on in their course. 

Differential Equations and Their Applications can be used for a one- or 
two-semester course in ordinary differential equations. It is geared to the 
student who has completed two semesters of calculus. Traditionally, most 
authors present a “suggested syllabus” for their textbook. We will not do so 
here, though, since there are already more than twenty different syllabi in 
use. Suffice it to say that this text can be used for a wide variety of courses 
in ordinary differential equations. 

I greatly appreciate the help of the following people in the preparation 
of this manuscript: Douglas Reber who wrote the Fortran programs, 
Eleanor Addison who drew the original figures, and Kate MacDougall, 
Sandra Spinacci, and Miriam Green who typed portions of this manu- 
script. 

I am grateful to Walter Kaufmann-Biuhler, the mathematics editor at 
Springer-Verlag, and Elizabeth Kaplan, the production editor, for their 
extensive assistance and courtesy during the preparation of this 
manuscript. It is a pleasure to work with these true professionals. 

Finally, I am especially grateful to Joseph P. LaSalle for the encourage- 
ment and help he gave me. Thanks again, Joe. 


New York City ; 
July, 1976 Martin Braun 


Chapter 1 
First-order differential equations 


Introduction 

First-order linear differential equations 

The Van Meegeren art forgeries 

Separable equations 

Population models 

The spread of technological innovations 

An atomic waste disposal problem 

The dynamics of tumor growth, mixing problems, and 
orthogonal trajectories 

Exact equations, and why we cannot solve very many 
differential equations 


0 The existence—uniqueness theorem; Picard iteration 
1 Finding roots of equations by iteration 


1.12 


1.13 


1.14 
1.15 
1.16 
1.17 


1.11.1 Newton’s method 

Difference equations, and how to compute the interest 
due on your student loans 

Numerical approximations; Euler’s method 

1.13.1 Error analysis for Euler’s method 

The three term Taylor series method 

An improved Euler method 

The Runge-Kutta method 

What to do in practice 


Contents 


Contents 


Chapter 2 
Second-order linear differential equations 


2.1 Algebraic properties of solutions 
2.2 Linear equations with constant coefficients 
2.2.1 Complex roots 
2.2.2 Equal roots; reduction of order 
2.3 The nonhomogeneous equation 
2.4 The method of variation of parameters 
2.5 The method of judicious guessing 
2.6 Mechanical vibrations 
2.6.1 The Tacoma Bridge disaster 
2.6.2 Electrical networks 
2.7 A model for the detection of diabetes 
2.8 Series solutions 
2.8.1 Singular points, Euler equations 
2.8.2 Regular singular points, the method of Frobenius 
2.8.3 Equal roots, and roots differing by an integer 
2.9 The method of Laplace transforms 
2.10 Some useful properties of Laplace transforms 


2.11 Differential equations with discontinuous nght-hand sides 


2.12 The Dirac delta function 

2.13 The convolution integral 

2.14 The method of elimination for systems 
2.15 Higher-order equations 


Chapter 3 
Systems of differential equations 


3.1 Algebraic properties of solutions of linear systems 

3.2 Vector spaces 

3.3 Dimension of a vector space 

3.4 Applications of linear algebra to differential equations 
3.5 The theory of determinants 

3.6 Solutions of simultaneous linear equations 

3.7 Linear transformations 

3.8 The eigenvalue—eigenvector method of finding solutions 
3.9 Complex roots 

3.10 Equal roots 

3.11 Fundamental matrix solutions; e4 

3.12 The nonhomogeneous equation; variation of parameters 
3.13 Solving systems by Laplace transforms 


Chapter 4 
Qualitative theory of differential equations 


4.1 Introduction 
4.2 Stability of linear systems 


127 


127 
138 
141 
145 
151 
153 
157 
165 
173 
175 
178 
185 
198 
203 
219 
225 
233 
238 
243 
251 
257 
259 


264 


264 
273 
279 
291 
297 
310 
320 
333 
341 
345 
355 
360 
368 


372 


372 
378 


4.3 Stability of equilibrium solutions 
4.4 The phase-plane 
4.5 Mathematical theories of war 
4.5.1 L.F. Richardson’s theory of conflict 
4.5.2 Lanchester’s combat models and the battle of Iwo Jima 
4.6 Qualitative properties of orbits 
4.7 Phase portraits of linear systems 
4.8 Long time behavior of solutions; the Poincaré-Bendixson Theorem 
4.9 Introduction to bifurcation theory 
4.10 Predator-prey problems; or why 
the percentage of sharks caught in the Mediterranean 
Sea rose dramatically during World War I 
4.11 The principle of competitive exclusion in population biology 
4.12 The Threshold Theorem of epidemiology 
4.13 A model for the spread of gonorrhea 


Chapter 5 
Separation of variables and Fourier series 


5.1 Two point boundary-value problems 

5.2 Introduction to partial differential equations 
5.3. The heat equation; separation of variables 
5.4 Fourier series 

5.5 Even and odd functions 

5.6 Return to the heat equation 

5.7 The wave equation 

5.8 Laplace’s equation 


Chapter 6 
Sturm—Liouville boundary value problems 


6.1 Introduction 

6.2 Inner product spaces 

6.3 Orthogonal bases, Hermitian operators 
6.4 Sturm—Liouville theory 


Appendix A 


Some simple facts concerning functions 
of several variables 


Appendix B 
Sequences and series 


Appendix C 
C Programs 


Contents 


385 
394 
398 
398 
405 
414 
418 
428 
437 


443 
451 
458 
465 


476 


476 
481 
483 
487 
493 
498 
503 
508 


514 


514 
515 
526 
533 


545 


547 


549 


Contents 


Answers to odd-numbered exercises 557 


Index 575 


First-order differential equations 


1.1 Introduction 


This book is a study of differential equations and their applications. A dif- 
ferential equation is a relationship between a function of time and its de- 
rivatives. The equations 


dy 2. . 
a = 3y*sin(t+y) (1) 
and 
dy d*y 
— =e *+t+ — il 
dt? dt? m) 


are both examples of differential equations. The order of a differential 
equation is the order of the highest derivative of the function y that ap- 
pears in the equation. Thus (1) is a first-order differential equation and (11) 
is a third-order differential equation. By a solution of a differential equa- 
tion we will mean a continuous function y(t) which together with its de- 
rivatives satisfies the relationship. For example, the function 


y(t)=2sint—4cos2t 
is a solution of the second-order differential equation 


ay + 2t 
— +y=cos 
dt? 

since 

d? : 1 : l 

7a (2sint— 3 COS 2t) + (2sint— 5 cos2r) 


=(—2sint+ $cos2r)+2sint— 5 cos2t=cos 21. 
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Differential equations appear naturally in many areas of science and the 
humanities. In this book, we will present serious discussions of the applica- 
tions of differential equations to such diverse and fascinating problems as 
the detection of art forgeries, the diagnosis of diabetes, the increase in the 
percentage of sharks present in the Mediterranean Sea during World War 
I, and the spread of gonorrhea. Our purpose is to show how researchers 
have used differential equations to solve, or try to solve, real life problems. 
And while we will discuss some of the great success stories of differential 
equations, we will also point out their limitations and document some of 
their failures. 


1.2 First-order linear differential equations 


We begin by studying first-order differential equations and we will assume 
that our equation is, or can be put, in the form 


d 
<= f(t). (1) 


The problem before us is this: Given f(t,y) find all functions y(t) which 
satisfy the differential equation (1). We approach this problem in the 
following manner. A fundamental principle of mathematics is that the way 
to solve a new problem is to reduce it, in some manner, to a problem that 
we have already solved. In practice this usually entails successively sim- 
plifying the problem until it resembles one we have already solved. Since 
we are presently in the business of solving differential equations, it is advis- 
able for us to take inventory and list all the differential equations we can 
solve. If we assume that our mathematical background consists of just ele- 
mentary calculus then the very sad fact is that the only first-order differen- 
tial equation we can solve at present is 


d 
— = (2) (2) 


where g is any integrable function of time. To solve Equation (2) simply 
integrate both sides with respect to ¢, which yields 


v(t)= f g(ndtte. 


Here c is an arbitrary constant of integration, and by | g(t)dt we mean an 


anti-derivative of g, that is, a function whose derivative is g. Thus, to solve 
any other differential equation we must somehow reduce it to the form (2). 
As we will see in Section 1.9, this is impossible to do in most cases. Hence, 
we will not be able, without the aid of a computer, to solve most differen- 
tial equations. It stands to reason, therefore, that to find those differential 
equations that we can solve, we should start with very simple equations 
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and not ones like 


ad _ esin(¢—37V|y| ) 


dt 
(which incidentally, cannot be solved exactly). Experience has taught us 
that the “simplest” equations are those which are linear in the dependent 
variable y. 


Definition. The general first-order linear differential equation is 


o +a(t)y=6(0). (3) 
Unless otherwise stated, the functions a(t) and b(t) are assumed to be 
continuous functions of time. We single out this equation and call it lin- 
ear because the dependent variable y appears by itself, that is, no terms 
such as e~”, y? or siny etc. appear in the equation. For example dy / dt 
=y?+sin¢ and dy /dt=cosy + ¢ are both nonlinear equations because of 
the y* and cosy terms respectively. 


Now it is not immediately apparent how to solve Equation (3). Thus, we 
simplify it even further by setting b(t) =0. 


Definition. The equation 
dy 
= +a(t)y=0 (4) 


is called the homogeneous first-order linear differential equation, and 
Equation (3) 1s called the nonhomogeneous first-order linear differential 
equation for b(t) not identically zero. 


Fortunately, the homogeneous equation (4) can be solved quite easily. 
First, divide both sides of the equation by y and rewrite it in the form 
dy 
dt 
—=-—a(t). 
Second, observe that 
dy 


dt qd 
yy =a nO 


where by In| y(¢)| we mean the natural logarithm of | y(¢)|. Hence Equation 
(4) can be written in the form 


“In| y(1)|=—a(1). (5) 
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But this is Equation (2) “essentially” since we can integrate both sides of 
(5) to obtain that 


In| y(2)| = — fa(s)dt+e, 


where c, iS an arbitrary constant of integration. Taking exponentials of 
both sides yields 


I» (Ol=exp( — fa(s)ar+c,)=cexp(— fatsar] 
or 


»exo( fata) =¢, (6) 


Now, y(t) exp i a(t) at) is acontinuous function of time and Equation (6) 


States that its absolute value is constant. But if the absolute value of a con- 
tinuous function g(t) is constant then g itself must be constant. To prove 
this observe that if g is not constant, then there exist two different times ¢, 
and f, for which g(t,)=c and g(t,)=—c. By the intermediate value theo- 
rem of calculus g must achieve all values between —c and +c which is im- 


possible if | g(t)|=c. Hence, we obtain the equation y(t) exp f a(t) at) =¢C 
or 


y(t)=cexp(— fatsdr), (7) 


Equation (7) 1s said to be the general solution of the homogeneous equa- 
tion since every solution of (4) must be of this form. Observe that an arbi- 
trary constant c appears in (7). This should not be too surprising. Indeed, 
we will always expect an arbitrary constant to appear in the general solu- 
tion of any first-order differential equation. To wit, if we are given dy /dt 
and we want to recover y(t), then we must perform an integration, and 
this, of necessity, yields an arbitrary constant. Observe also that Equation 
(4) has infinitely many solutions; for each value of c we obtain a distinct 
solution y(f). 


Example 1. Find the general solution of the equation (dy /dt)+2ty =0. 
Solution. Here a(t)=2t¢ so that y(t)= cexp| _ [tae =ce-". 


Example 2. Determine the behavior, as t->oo, of all solutions of the equa- 
tion (dy /dt)+ ay =0, a constant. 


Solution. The general solution is y(t)=c exp| — f a at} =ce~™, Hence if 


a<0, all solutions, with the exception of y =0, approach infinity, and if a 
> 0, all solutions approach zero as t->0©o. 
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In applications, we are usually not interested in all solutions of (4). 
Rather, we are looking for the specific solution y(t) which at some initial 
time f) has the value y). Thus, we want to determine a function y(t) such 
that 


d 
S+a(t)y=0,  y(to)=Ye (8) 


Equation (8) is referred to as an initial-value problem for the obvious rea- 
son that of the totality of all solutions of the differential equation, we are 
looking for the one solution which initially (at time ¢,) has the value yo. To 
find this solution we integrate both sides of (5) between ¢) and t. Thus 


[ Gavola=— fase 
and, therefore 
y(t) 
¥ (to) 
Taking exponentials of both sides of this equation we obtain that 


y(t) 
oh) = exp(~ f'a(s)a 


In| y(t)|—In| y (to)|=In 


=— fas) ds. 


t 


Or 


The function inside the absolute value sign is a continuous function of 
time. Thus, by the argument given previously, it is either identically +1 or 
identically —1. To determine which one it is, evaluate it at the point 1; 
since 


y(t) to _ 

y(t) exp( f a(s)ds} = | 
we see that 

y(t) 

yp) exp als) =| 
Hence 


(1) =v (0) e(— f a(3)ds}=ygexr(— f'a(s)as} 
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Example 3. Find the solution of the initial-value problem 
d 
—+(sint)y=0,  y(0)=3. 


Solution. Here a(t)=sint¢ so that 


t 
y(t)=3 exo| — i sins ds] = Zeleos— 1, 


Example 4. Find the solution of the initial-value problem 


dy a — 
at? y =0, y(1)=2. 


Solution. Here a(t)=e" so that 
y(a)=2exp{ - f'e*as). 
1 


Now, at first glance this problem would seem to present a very serious dif- 
ficulty in that we cannot integrate the function e* directly. However, this 
solution is equally as valid and equally as useful as the solution to Example 
3. The reason for this is twofold. First, there are very simple numerical 
schemes to evaluate the above integral to any degree of accuracy with the 
aid of a computer. Second, even though the solution to Example 3 is given 
explicitly, we still cannot evaluate it at any time ¢ without the aid of a table 
of trigonometric functions and some sort of calculating aid, such as a slide 
rule, electronic calculator or digital computer. 


We return now to the nonhomogeneous equation 
dy 
—+a(t)y=b(t). 
— +a(t)y=B(1) 

It should be clear from our analysis of the homogeneous equation that the 

way to solve the nonhomogeneous equation is to express it in the form 


a (“something”) = b(1) 


and then to integrate both sides to solve for “something”. However, the ex- 
pression (dy /dt)+ a(t)y does not appear to be the derivative of some sim- 
ple expression. The next logical step in our analysis therefore should be the 
following: Can we make the left hand side of the equation to be d/dt of 
“something”? More precisely, we can multiply both sides of (3) by any 
continuous function p(t) to obtain the equivalent equation 


we) +a(0) wl) y= (H0(0). (9) 
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(By equivalent equations we mean that every solution of (9) is a solution of 
(3) and vice-versa.) Thus, can we choose p(t) so that p(t)(dy / dt) + 
a(t) p(t)y is the derivative of some simple expression? The answer to this 
question is yes, and is obtained by observing that 
dy ad 

“ w(t) y= w(t) — + = y, 
Hence, p(t)(dy /dt)+ a(t) u(t)y will be equal to the derivative of u(1)y if 
and only if du(t)/dt=a(t) u(t). But this is a first-order linear homoge- 
neous equation for p(t), i.e. (du/dt)— a(t) u=0 which we already know 
how to solve, and since we only need one such function p(t) we set the 
constant c in (7) equal to one and take 


a(1)=exp( fa(s)ar). 
For this p(t), Equation (9) can be written as 


£ u(t) y= n(1)0(2). (10) 


To obtain the general solution of the nonhomogeneous equation (3), that 
is, to find all solutions of the nonhomogeneous equation, we take the indef- 
inite integral (anti-derivative) of both sides of (10) which yields 


p(t)y = J w(t)b(t)dt +c 


or 
v=o I u(1)b(t)dt+c) =exp(— fa(eyat)( f w(syo(s) dt +c} (11) 


Alternately, if we are interested in the specific solution of (3) satisfying 
the initial condition y(¢9)= yo, that is, if we want to solve the initial-value 
problem 


dy 
WTI y=h(), — V(%) = Yo 


then we can take the definite integral of both sides of (10) between ¢, and ¢ 
to obtain that 


M(t) Y — W(to) Yo= J m(s)b(s)as 
Or 


y= Fi madot f H(s)0(s)a6} (12) 


Remark 1. Notice how we used our knowledge of the solution of the ho- 
mogeneous equation to find the function p(t) which enables us to solve the 
nonhomogeneous equation. This is an excellent illustration of how we use 
our knowledge of the solution of a simpler problem to solve a harder prob- 
lem. 
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Remark 2. The function aCe) =exp( J a(t) dt) is called an integrating factor 


for the nonhomogeneous equation since after multiplying both sides by 
u(t) we can immediately integrate the equation to find all solutions. 


Remark 3. The reader should not memorize formulae (11) and (12). 
Rather, we will solve all nonhomogeneous equations by first multiplying 
both sides by p(t), by writing the new left-hand side as the derivative of 
u(t) y(t), and then by integrating both sides of the equation. 


Remark 4. An alternative way of solving the initial-value problem (dy /dr) 
+ a(t) y = D(t), y(to) = Yo 18 to find the general solution (11) of (3) and then 
use the initial condition y (tp) = yp to evaluate the constant c. If the function 
u(t)b(t) cannot be integrated directly, though, then we must take the defi- 
nite integral of (10) to obtain (12), and this equation is then approximated 
numerically. 


Example 5. Find the general solution of the equation (dy /dt)—2ty =. 
Solution. Here a(t)= —2t so that 


u(1)=exp( f a(s)dt) =exp( — f 2ea)= en? 


Multiplying both sides of the equation by p(t) we obtain the equivalent 
equation 


e*(F - 20) =1e-# or fe ty=te*. 
Hence, 
ety= fiePdttc= =e" +c 
and 


y(th=—t+ce". 


Example 6. Find the solution of the initial-value problem 


Y in ,- _ 
a +2ty =t, y(1)=2. 


Solution. Here a(t)=2t so that 
u(2)=exp( fa(1)dr)=exp( f 21d) =e" 


Multiplying both sides of the equation by p(t) we obtain that 
d 


e° (F420) =e" or —(e'y) = te". 


1.2 First-order linear differential equations 


Hence, 
t t 
J sey sas f se* ds 
so that 
t yf 
5 e 
e y(s)| = > I: 
Consequently, 
2 
e'y—e= > — 5 
and 
l 3e — 72 _ 72 
yaantae f ++ie! a 


Example 7. Find the solution of the initial-value problem 


O y= y(2)=3 
dt 1+22- 


Solution. Here a(t)=1, so that 


u(1)=exp( [a(2)dt) =expl f 1dr) =e' 


Multiplying both sides of the equation by p(r) we obtain that 


(S+y)= Cor 4 oye e 
dt 1+7? dt 1+22- 


Hence 
t dad t es 
— e*y(s) ds -{ ds, 
i ds > 1+ s? 
so that 
t AY 
ey —3et— f cds 
5 1+s? 
and 
t AY 
y=e'! 302+ | c 5 ds 
, [+s 
EXERCISES 


In each of Problems 1-7 find the general solution of the given differential 
equation. 
dy 


dy - , 
1. TTY cost =0 2. Rtv! sint=0 


l 


First-order differential equations 


dy 21 I 

—_ = 4 + —_— t 
di j4p> 14P a?" 
yo. _ dy», 4 
mtn 6. ative! 
dy ! 3 


t 
ae =]— 
dt I+ 72> 1+74° 


In each of Problems 8-14, find the solution of the given initial-value prob- 


lem. 
8. Os ViFF y=0, yO=V5 9. P+ VIFF ety, y(0)=1 
10. Y Vier e y=0, y(O)=0 11. ® _oyat, y(0)=1 
dt dt 
ye \_ dy) _ 
12. a tyaite y(5)=0 13. dt Tap? y(1l)=2 
dy 
14, wv 2veh y(0)=1 
15. Find the general solution of the equation 


16. 


17. 


18. 


19. 


142) 2 ty =(42) 


(Hint: Divide both sides of the equation by 1 + 7.) 


5/2 


Find the solution of the initial-value problem 


(1+) 2 saya, y=. 


Find a continuous solution of the initial-value problem 


where 


2, O<t<l 
i=.” ; 
ay) e t>1 
Show that every solution of the equation (dy/dt)+ ay =be~“ where a and c 
are positive constants and b is any real number approaches zero as ¢ ap- 
proaches infinity. 


Given the differential equation (dy /dt)+a(t)y=f(t) with a(t) and f(t) con- 
tinuous for ~0 <t< oo, a(t)>c>0, and lim,,,, f(t)=0, show that every 
solution tends to zero as ¢ approaches infinity. 


When we derived the solution of the nonhomogeneous equation we tacitly 
assumed that the functions a(t) and b(t) were continuous so that we could 
perform the necessary integrations. If either of these functions was discon- 
tinuous at a point ¢,, then we would expect that our solutions might be dis- 
continuous at ¢=¢,. Problems 20-23 illustrate the variety of things that 
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may happen. In Problems 20-22 determine the behavior of all solutions of 
the given differential equation as t->0, and in Problem 23 determine the 
behavior of all solutions as t>7/2. 


20. atp 2p 21. ay; ° 
di. _ sin t dy _. 
22. Gt py = Oost t+ 23. a ty tant=sinécost. 


1.3 The Van Meegeren art forgeries 


After the liberation of Belgium in World War II, the Dutch Field Security 
began its hunt for Nazi collaborators. They discovered, in the records of a 
firm which had sold numerous works of art to the Germans, the name of a 
banker who had acted as an intermediary in the sale to Goering of the 
painting “Woman Taken in Adultery” by the famed 17th century Dutch 
painter Jan Vermeer. The banker in turn revealed that he was acting on 
behalf of a third rate Dutch painter H. A. Van Meegeren, and on May 239, 
1945 Van Meegeren was arrested on the charge of collaborating with the 
enemy. On July 12, 1945 Van Meegeren startled the world by announcing 
from his prison cell that he had never sold ““Woman Taken in Adultery” to 
Goering. Moreover, he stated that this painting and the very famous and 
beautiful “Disciples at Emmaus”, as well as four other presumed Vermeers 
and two de Hooghs (a 17th century Dutch painter) were his own work. 
Many people, however, thought that Van Meegeren was only lying to save 
himself from the charge of treason. To prove his point, Van Meegeren be- 
gan, while in prison, to forge the Vermeer painting “Jesus Amongst the 
Doctors” to demonstrate to the skeptics just how good a forger of Vermeer 
he was. The work was nearly completed when Van Meegeren learned that 
a charge of forgery had been substituted for that of collaboration. He, 
therefore, refused to finish and age the painting so that hopefully investiga- 
tors would not uncover his secret of aging his forgeries. To settle the ques- 
tion an international panel of distinguished chemists, physicists and art 
historians was appointed to investigate the matter. The panel took x-rays 
of the paintings to determine whether other paintings were underneath 
them. In addition, they analyzed the pigments (coloring materials) used in 
the paint, and examined the paintings for certain signs of old age. 

Now, Van Meegeren was well aware of these methods. To avoid detec- 
tion, he scraped the paint from old paintings that were not worth much, 
Just to get the canvas, and he tried to use pigments that Vermeer would 
have used. Van Meegeren also knew that old paint was extremely hard, 
and impossible to dissolve. Therefore, he very cleverly mixed a chemical, 
phenoformaldehyde, into the paint, and this hardened into bakelite when 
the finished painting was heated in an oven. 
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However, Van Meegeren was careless with several of his forgeries, and 
the panel of experts found traces of the modern pigment cobalt blue. In 
addition, they also detected the phenoformaldehyde, which was not dis- 
covered until the turn of the 19th century, in several of the paintings. On 
the basis of this evidence Van Meegeren was convicted, of forgery, on Oc- 
tober 12, 1947 and sentenced to one year in prison. While in prison he 
suffered a heart attack and died on December 30, 1947. 

However, even following the evidence gathered by the panel of experts, 
many people still refused to believe that the famed “Disciples at Emmaus” 
was forged by Van Meegeren. Their contention was based on the fact that 
the other alleged forgeries and Van Meegeren’s nearly completed “Jesus 
Amongst the Doctors” were of a very inferior quality. Surely, they said, the 
creator of the beautiful “Disciples at Emmaus” could not produce such in- 
ferior pictures. Indeed, the “Disciples at Emmaus” was certified as an 
authentic Vermeer by the noted art historian A. Bredius and was bought 
by the Rembrandt Society for $170,000. The answer of the panel to these 
skeptics was that because Van Meegeren was keenly disappointed by his 
lack of status in the art world, he worked on the “Disciples at Emmaus” 
with the fierce determination of proving that he was better than a third 
rate painter. After producing such a masterpiece his determination was 
gone. Moreover, after seeing how easy it was to dispose of the “Disciples at 
Emmaus” he devoted less effort to his subsequent forgeries. This explana- 
tion failed to satisfy the skeptics. They demanded a thoroughly scientific 
and conclusive proof that the “Disciples at Emmaus” was indeed a forgery. 
This was done recently in 1967 by scientists at Carnegie Mellon University, 
and we would now like to describe their work. 

The key to the dating of paintings and other materials such as rocks and 
fossils lies in the phenomenon of radioactivity discovered at the turn of the 
century. The physicist Rutherford and his colleagues showed that the 
atoms of certain “radioactive” elements are unstable and that within a 
given time period a fixed proportion of the atoms spontaneously disin- 
tegrates to form atoms of a new element. Because radioactivity is a prop- 
erty of the atom, Rutherford showed that the radioactivity of a substance 
is directly proportional to the number of atoms of the substance present. 
Thus, if M(t) denotes the number of atoms present at time ¢, then dN / dt, 
the number of atoms that disintegrate per unit time is proportional to N, 
that is, 

dN _ 

dt — 
The constant A which is positive, is known as the decay constant of the 
substance. The larger A is, of course, the faster the substance decays. One 
measure of the rate of disintegration of a substance is its half-life which is 
defined as the time required for half of a given quantity of radioactive 
atoms to decay. To compute the half-life of a substance in terms of A, 
assume that at time f), N(t))= Np. Then, the solution of the initial-value 


—)N. (1) 
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problem dN /dt= —AXN, N (to) = No is 
t 
N (1)= Noexo(—A f is) = mye 
Lo 


or N/N,=exp(—A(t— ty)). Taking logarithms of both sides we obtain that 


= jn 
A(t — to) =In No. (2) 
Now, if N/N j= 3 then —A(t—%))=In} so that 
In2 0.6931 
(1 b)= ==. (3) 


Thus, the half-life of a substance is In2 divided by the decay constant A. 
The dimension of A, which we suppress for simplicity of writing, is recipro- 
cal time. If ¢ is measured in years then A has the dimension of reciprocal 
years, and if ¢ is measured in minutes, then A has the dimension of recipro- 
cal minutes. The half-lives of many substances have been determined and 
recorded. For example, the half-life of carbon-14 is 5568 years and the 
half-life of uranium-238 is 4.5 billion years. 

Now the basis of “radioactive dating” is essentially the following. From 
Equation (2) we can solve for t—t)=1/Aln(N,/N). If fo is the time the 
substance was initially formed or manufactured, then the age of the sub- 
stance is 1/Aln(N,/N). The decay constant A is known or can be com- 
puted, in most instances. Moreover, we can usually evaluate N quite easily. 
Thus, if we knew Ny we could determine the age of the substance. But this 
is the real difficulty of course, since we usually do not know Np. In some 
instances though, we can either determine N, indirectly, or else determine 
certain suitable ranges for No, and such is the case for the forgeries of Van 
Meegeren. 

We begin with the following well-known facts of elementary chemistry. 
Almost all rocks in the earth’s crust contain a small quantity of uranium. 
The uranium in the rock decays to another radioactive element, and that 
one decays to another and another, and so forth (see Figure 1) in a series 
of elements that results in lead, which is not radioactive. The uranium 
(whose half-life is over four billion years) keeps feeding the elements 
following it in the series, so that as fast as they decay, they are replaced by 
the elements before them. 

Now, all paintings contain a small amount of the radioactive element 
lead-210 (7!°Pb), and an even smaller amount of radium-226 (776Ra), since 
these elements are contained in white lead (lead oxide), which is a pigment 
that artists have used for over 2000 years. For the analysis which follows, it 
is important to note that white lead is made from lead metal, which, in 
turn, is extracted from a rock called lead ore, in a process called smelting. 
In this process, the lead-210 in the ore goes along with the lead metal. 
However, 90-95% of the radium and its descendants are removed with 
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other waste products in a material called slag. Thus, most of the supply of 
lead-210 is cut off and it begins to decay very rapidly, with a half-life of 22 
years. This process continues until the lead-210 in the white lead is once 
more in radioactive equilibrium with the small amount of radium present, 
i.e. the disintegration of the lead-210 is exactly balanced by the disintegra- 
tion of the radium. 

Let us now use this information to compute the amount of lead-210 pre- 
sent in a sample in terms of the amount originally present at the time of 
manufacture. Let y(t) be the amount of lead-210 per gram of white lead 
at time ¢t, yy the amount of lead-210 per gram of white lead present at 
the time of manufacture ¢), and r(¢) the number of disintegrations of 
radium-226 per minute per gram of white lead, at time ¢. If A is the decay 
constant for lead-210, then 


Bab tr(), y(t) =o (4) 


Since we are only interested in a time period of at most 300 years we may 
assume that the radium-226, whose half-life is 1600 years, remains con- 
stant, so that r(t) is a constant r. Multiplying both sides of the differential 
equation by the integrating factor u(t)=e™ we obtain that 


da AL, — p» At 
ne ate 
Hence 
ery ( t) _ eMoy, = x ( eM — ero) 
Or 


y()=F (eM) + ye MEW, (5) 


Now y(t) and r can be easily measured. Thus, if we knew y, we could 
use Equation (5) to compute (¢— 7) and consequently, we could determine 
the age of the painting. As we pointed out, though, we cannot measure y, 
directly. One possible way out of this difficulty is to use the fact that the 
original quantity of lead-210 was in radioactive equilibrium with the larger 
amount of radium-226 in the ore from which the metal was extracted. Let 
us, therefore, take samples of different ores and count the number of disin- 
tegrations of the radium-226 in the ores. This was done for a variety of 
ores and the results are given in Table 1 below. These numbers vary from 
0.18 to 140. Consequently, the number of disintegrations of the lead-210 
per minute per gram of white lead at the time of manufacture will vary 
from 0.18 to 140. This implies that yy will also vary over a very large inter- 
val, since the number of disintegrations of lead-210 is proportional to the 
amount present. Thus, we cannot use Equation (5) to obtain an accurate, 
or even a crude estimate, of the age of a painting. 
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Table 1. Ore and ore concentrate samples. All disintegration rates 
are per gram of white lead. 


Description and Source Disintegrations per minute of ?7°Ra 
Ore concentrate §(Oklahoma—Kansas) 4.5 
Crushed raw ore (S.E. Missouri) 2.4 
Ore concentrate  (S.E. Missouri) 0.7 
Ore concentrate (Idaho) 2.2 
Ore concentrate (Idaho) 0.18 
Ore concentrate | (Washington) 140.0 
Ore concentrate (British Columbia) 1.9 
Ore concentrate (British Columbia) 0.4 
Ore concentrate (Bolivia) 1.6 
Ore concentrate (Australia) 1.1 


However, we can still use Equation (5) to distinguish between a 17th 
century painting and a modern forgery. The basis for this statement is the 
simple observation that if the paint is very old compared to the 22 year 
half-life of lead, then the amount of radioactivity from the lead-210 in the 
paint will be nearly equal to the amount of radioactivity from the radium 
in the paint. On the other hand, if the painting is modern (approximately 
20 years old, or so) then the amount of radioactivity from the lead-210 will 
be much greater than the amount of radioactivity from the radium. 

We make this argument precise in the following manner. Let us assume 
that the painting in question is-either very new or about 300 years old. Set 
t— t)=300 years in (5). Then, after some simple algebra, we see that 


AYo = Ay (t)e70™ — r(e2™ — 1). (6) 


If the painting is indeed a modern forgery, then Ay, will be absurdly 
large. To determine what is an absurdly high disintegration rate we observe 
(see Exercise 1) that if the lead-210 decayed originally (at the time of 
manufacture) at the rate of 100 disintegrations per minute per gram of 
white lead, then the ore from which it was extracted had a uranium con- 
tent of approximately 0.014 per cent. This is a fairly high concentration of 
uranium since the average amount of uranium in rocks of the earth’s crust 
is about 2.7 parts per million. On the other hand, there are some very rare 
ores in the Western Hemisphere whose uranium content is 2-3 per cent. To 
be on the safe side, we will say that a disintegration rate of lead-210 is cer- 
tainly absurd if it exceeds 30,000 disintegrations per minute per gram of 
white lead. 

To evaluate Ayp, we must evaluate the present disintegration rate, Ay(1), 
of the lead-210, the disintegration rate r of the radium-226, and e?™. Since 
the disintegration rate of polonium-210 (?!°Po) equals that of lead-210 after 
several years, and since it is easier to measure the disintegration rate of 
polonium-210, we substitute these values for those of lead-210. To compute 


16 


1.3 The Van Meergeren art forgeries 


e300 we observe from (3) that \=(In2/22). Hence 


e300 _ e (300/22)In2 _ 9(150/1 1). 


The disintegration rates of polonium-210 and radium-226 were measured 
for the “Disciples at Emmaus” and various other alleged forgeries and are 
given in Table 2 below. 


Table 2. Paintings of questioned authorship. All disintegration 
rates are per minute, per gram of white lead. 


210Po disintegration 


Description 26Ra disintegration 


“Disciples at Emmaus” 0.8 
“Washing of Feet” 0.26 
“Woman Reading Music” 0.3 
“Woman Playing Mandolin” 0.17 
‘Lace Maker” 1.4 


“Laughing Girl” 


If we now evaluate Ay, from (6) for the white lead in the painting “Disci- 
ples at Emmaus” we obtain that 


AYo= (8.5)2 10/1 _ 0.8(2 1SO/11 _ 1) 
= 98,050 


which is unacceptably large. Thus, this painting must be a modern forgery. 
By a similar analysis, (see Exercises 2-4) the paintings “Washing of Feet”, 
“Woman Reading Music” and “Woman Playing Mandolin” were indisput- 
ably shown to be faked Vermeers. On the other hand, the paintings “Lace 
Maker” and “Laughing Girl” cannot be recently forged Vermeers, as 
claimed by some experts, since for these two paintings, the polonium-210 is 
very nearly in radioactive equilibrium with the radium-226, and no such 
equilibrium has been observed in any samples from 19th or 20th century 
paintings. 
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EXERCISES 


1. In this exercise we show how to compute the concentration of uranium in an 
ore from the dpm/(g of Pb) of the lead-210 in the ore. 

(a) The half-life of uranium-238 is 4.51 x 10? years. Since this half-life is so 
large, we may assume that the amount of uranium in the ore is constant 
over a period of two to three hundred years. Let N (t) denote the number of 
atoms of 738U per gram of ordinary lead in the ore at time ¢. Since the 
lead-210 is in radioactive equilibrium with the uranium-238 in the ore, we 
know that dN /dt= —AN=-— 100 dpm/g of Pb at time ¢o. Show that there 
are 3.4210!” atoms of uranium-238 per gram of ordinary lead in the ore 
at time fp. (Hint: 1 year=525,600 minutes.) 

(b) Using the fact that one mole of uranium-238 weighs 238 grams, and that 
there are 6.02 x 1073 atoms in a mole, show that the concentration of 
uranium in the ore is approximately 0.014 percent. 


For each of the paintings 2, 3, and 4 use the data in Table 2 to compute 
the disintegrations per minute of the original amount of lead-210 per gram 
of white lead, and conclude that each of these paintings is a forged 
Vermeer. 


2. “Washing of Feet” 
3. “Woman Reading Music” 
4. “Woman Playing Mandolin” 


5. The following problem describes a very accurate derivation of the age of 
uranium. 
(a) Let N3g(¢) and N.3,(t) denote the number of atoms of 778U and 7°U at 
time ¢ in a given sample of uranium, and let t=0 be the time this sample 
was created. By the radioactive decay law, 


_—in2_ 

= N38 (t) = 45)10 N38 (t), 
n2 

giNas (= oo N35 (t). 


Solve these equations for N3.(¢) and N.3,(f) in terms of their original num- 
bers N53,(0) and N23<(0). 

(b) In 1946 the ratio of 78U/?3°U in any sample was 137.8. Assuming that 
equal amounts of 7°U and 735U appeared in any sample at the time of its 
creation, show that the age of uranium is 5.96 Xx 10° years. This figure is 
universally accepted as the age of uranium. 


6. In a samarskite sample discovered recently, there was 3 grams of Thorium 
(272TH). Thorium decays to lead-208 (7°8Pb) through the reaction 737Th->?Pb 
+ 6(44He). It was determined that 0.0376 of a gram of lead-208 was produced 
by the disintegration of the original Thorium in the sample. Given that the 
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half-life of Thorium is 13.9 billion years, derive the age of this samarskite sam- 
ple. (Hint: 0.0376 grams of ?°8Pb is the product of the decay of (232/208) x 
0.0376 grams of Thorium.) 

One of the most accurate ways of dating archaeological finds is the method 
of carbon-14 ('4C) dating discovered by Willard Libby around 1949. The basis 
of this method is delightfully simple: The atmosphere of the earth is continu- 
ously bombarded by cosmic rays. These cosmic rays produce neutrons in the 
earth’s atmosphere, and these neutrons combine with nitrogen to produce !4C, 
which is usually called radiocarbon, since it decays radioactively. Now, this ra- 
diocarbon is incorporated in carbon dioxide and thus moves through the atmo- 
sphere to be absorbed by plants. Animals, in turn, build radiocarbon into their 
tissues by eating the plants. In living tissue, the rate of ingestion of '*C exactly 
balances the rate of disintegration of '*C. When an organism dies, though, it 
ceases to ingest carbon-14 and thus its '*C concentration begins to decrease 
through disintegration of the '*C present. Now, it is a fundamental assumption 
of physics that the rate of bombardment of the earth’s atmosphere by cosmic 
rays has always been constant. This implies that the original rate of disintegra- 
tion of the '*C in a sample such as charcoal is the same as the rate measured 
today.* This assumption enables us to determine the age of a sample of char- 
coal. Let N(t) denote the amount of carbon-14 present in a sample at time /, 
and N, the amount present at time t=0 when the sample was formed. If A de- 
notes the decay constant of '4C (the half-life of carbon-14 is 5568 years) then 
dN (t)/ dt = —AN (t), N(0)= No. Consequently, N (t)= Noe~™’. Now the present 
rate R(t) of disintegration of the '*C in the sample is given by R(t)=AN (t)= 
ANoe~™ and the original rate of disintegration is R(0)=ANp. Thus, R(t) / R(0) 
=e so that r=(1/A)In[R(0)/R(t)]. Hence if we measure R(t), the present 
rate of disintegration of the '4C in the charcoal, and observe that R(0) must 
equal the rate of disintegration of the '*C in a comparable amount of living 
wood, then we can compute the age ¢ of the charcoal. The following two prob- 
lems are real life illustrations of this method. 


7. Charcoal from the occupation level of the famous Lascaux Cave in France 
gave an average count in 1950 of 0.97 disintegrations per minute per gram. 
Living wood gave 6.68 disintegrations. Estimate the date of occupation and 
hence the probable date of the remarkable paintings in the Lascaux Cave. 


8. In the 1950 excavation at Nippur, a city of Babylonia, charcoal from a roof 
beam gave a count of 4.09 disintegrations per minute per gram. Living wood 
gave 6.68 disintegrations. Assuming that this charcoal was formed during the 
time of Hammurabi’s reign, find an estimate for the likely time of Hamurabi’s 
succession. 


*Since the mid 1950s the testing of nuclear weapons has significantly increased the amount of 
radioactive carbon in our atmosphere. Ironically this unfortunate state of affairs provides us 
with yet another extremely powerful method of detecting art forgeries. To wit, many artists’ 
materials, such as linseed oil and canvas paper, come from plants and animals, and so will 
contain the same concentration of carbon-14 as the atmosphere at the time the plant or 
animal dies. Thus linseed oil (which is derived from the flax plant) that was produced during 
the last few years will contain a much greater concentration of carbon-14 than linseed oil pro- 
duced before 1950. 
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1.4 Separable equations 


We solved the first-order linear homogeneous equation 


d 
— +a(t)y=0 (1) 


by dividing both sides of the equation by y(¢) to obtain the equivalent 
equation 


dy(t 
1 M(t) _ 

y ( t) dt =—~a (t) (2) 
and observing that Equation (2) can be written in the form 

d on 

£ in|y(t)|=— (0). (3) 


We then found In| y(2)|, and consequently y(r), by integrating both sides of 
(3). In an exactly analogous manner, we can solve the more general dif- 
ferential equation 


a _ 0) 
dt f(y) 


where f and g are continuous functions of y and ¢. This equation, and any 
other equation which can be put into this form, is said to be separable. To 
solve (4), we first multiply both sides by f(y) to obtain the equivalent 
equation 


(4) 


dy 
f(y) =a(0). (5) 
Then, we observe that (5) can be written in the form 
d 
ary Masa (6) 


where F(y) is any anti-derivative of f(y); ie, F(y)= J f(y)dy. Conse- 
quently, 


F(x())= f g(dar+e (7) 
where c is an arbitrary constant of integration, and we solve for y=y(f) 


from (7) to find the general solution of (4). 


Example 1. Find the general solution of the equation dy /dt = 1?/y?. 
Solution. Multiplying both sides of this equation by y? gives 


dy fa y*(t) 
2 = 72 — = 2 
Yat a - 


Hence, y*(¢)= 73+ c where c is an arbitrary constant, and y(t)=(t?+c)!/. 
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Example 2. Find the general solution of the equation 


dy 
e?— —1- 2 =0. 
at 
Solution. This equation can be written in the form 


fr =t+r 


and thus e”® = 1¢7/2+ t*/4+c. Taking logarithms of both sides of this 
equation gives y(t)=In(t?/2+ t4/4+c). 


In addition to the differential equation (4), we will often impose an ini- 
tial condition on y(t) of the form y(¢j)=yo. The differential equation (4) 
together with the initial condition y(t9)= yo is called an initial-value prob- 
lem. We can solve an initial-value problem two different ways. Either we 
use the initial condition y(¢))= yp» to solve for the constant c in (7), or else 
we integrate both sides ot (6) between f, and ¢ to obtain that 


F(y())-Fo)= f "a(s) ds. (8) 
If we now observe that 
F(y)—F(y)= f° f(r)ar, (9) 


then we can rewrite (8) in the simpler form 


[Fo ar= J a(s)as (10) 


Example 3. Find the solution y(¢) of the initial-value problem 


dy 
5 rn 3) — = 
er (t+¢°?)=0, y()=1. 
Solution. Method (1). From Example 2, we know that the general solution 
of this equation is y=In(t*/2+ 14/4+c). Setting t=1 and y=1 gives 1= 
In(3/4+c), or c=e—3/4. Hence, y(t)=In(e—3/4+4+ 17/24 14/4). 

Method (11). From (10), 


[retdr= fst s?) ds. 


Consequently, 
ent vet l ang y(t)=In(e—3/44 0°/2+ 14/4). 
2 4 2 4° 


Example 4. Solve the initial-value problem dy /dt=1+ 7’, y(0)=0. 
Solution. Divide both sides of the differential equation by 1+, to obtain 
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the equivalent equation 1/(1+y7)dy /dt=1. Then, from (10) 


Yar t 
——— = | ds. 
| 1+r? i 


Consequently, arctany =f, and y =tant. 

The solution y = tan? of the above problem has the disturbing property 
that it goes to too at t= +7/2. And what’s even more disturbing is the 
fact that there is nothing at all in this initial-value problem which even 
hints to us that there is any trouble at t= + 7/2. The sad fact of life is that 
solutions of perfectly nice differential equations can go to infinity in finite 
time. Thus, solutions will usually exist only on a finite open interval a<t< 
b, rather than for all time. Moreover, as the following example shows, dif- 
ferent solutions of the same differential equation usually go to infinity at 
different times. 


Example 5. Solve the initial-value problem dy /dt=1+y?, y(0)=1. 


Solution. From (10) 
y dr t 
—— =| ds. 
J 1+r? J 


Consequently, arctany —arctanl=¢, and y=tan(¢+7/4). This solution 
exists on the open interval —37/4<t<7/4. 


Example 6. Find the solution y(7) of the initial-value problem 
d 
y= +(1+y?)sint=0, yp (0)=1. 


Solution. Dividing both sides of the differential equation by 1+? gives 


» y = —sint 
l+y? dt 
Consequently, 
y 
[ rar _ f"_ sins ds, 
, itr? Yo 
so that 


sIn(1 +?) — 31n2=cost—1. 
Solving this equation for y(t) gives 
y(t) = + (2e~ 4sin’s/2 _ 1), 


To determine whether we take the plus or minus branch of the square root, 
we note that y(0) is positive. Hence, 
y (t) — (2e~ 4sin’e/2 _ 1” 
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This solution is only defined when 
Qe~4sin’t/25 1 
or 
eAsin’s/2 < 9. (11) 
Since the logarithm function is monotonic increasing, we may take loga- 


rithms of both sides of (11) and still preserve the inequality. Thus, 4sin?z/2 
<1n2, which implies that 


Vin2 
2 


t . 
[| <aresin 


Therefore, y(t) only exists on the open interval (— a,a) where 
a=2arcsin| Vin2 /2]. 

Now, this appears to be a new difficulty associated with nonlinear equa- 

tions, since y(f) just “disappears” at t= +a, without going to infinity. 

However, this apparent difficulty can be explained quite easily, and more- 


over, can even be anticipated, if we rewrite the differential equation above 
in the standard form 


dy (1+ y?)sin¢ 

a 
Notice that this differential equation is not defined when y =0. Therefore, 
if a solution y(t) achieves the value zero at some time t= /*, then we 


cannot expect it to be defined for ¢ > ¢*. This is exactly what happens here, 
since y(+a)=0. 


Example 7. Solve the initial-value problem dy /dt=(1+,y)t, y(O)=—1. 

Solution. In this case, we cannot divide both sides of the differential equa- 
tion by 1+, since y(0)= — 1. However, it is easily seen that y(t)= —1 is 
one solution of this initial-value problem, and in Section 1.10 we show that 
it is the only solution. More generally, consider the initial-value problem 


dy / dt = f(y) g(t), Y(to)=Yo, where f(y) =0. Certainly, y(t)= yo is one 
solution of this initial-value problem, and in Section 1.10 we show that it is 
the only solution if df/dy exists and is continuous. 


Example 8. Solve the initial-value problem 
(l+e”)dy/dt=cost,  y(a/2)=3. 
Solution. From (10), 


t 
fa + e")dr= f cos s ds 
3 a /2 
so that y+ e” =2+ e?+sinz. This equation cannot be solved explicitly for y 
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as a function of ¢. Indeed, most separable equations cannot be solved ex- 
plicitly for y as a function of ¢. Thus, when we say that 


yte’=2+e%+sint 


is the solution of this initial-value problem, we really mean that it is an im- 
plicit, rather than an explicit solution. This does not present us with any 
difficulties in applications, since we can always find y(t) numerically with 
the aid of a digital computer (see Section 1.11). 


Example 9. Find all solutions of the differential equation dy/dt=—t/y. 
Solution. Multiplying both sides of the differential equation by y gives 
ydy/dt=—t. Hence 


yet = c?%, (12) 


Now, the curves (12) are closed, and we cannot solve for y as a single-val- 
ued function of t. The reason for this difficulty, of course, is that the dif- 
ferential equation is not defined when y =0. Nevertheless, the circles 47+ y? 
=c* are perfectly well defined, even when y=0. Thus, we will call the 


circles 7+ y?=c? solution curves of the differential equation 
dy /dt=-—t/y. 


More generally, we will say that any curve defined by (7) is a solution 
curve of (4). 


EXERCISES 


In each of Problems 1-5, find the general solution of the given differential 
equation. 


tan x + tany 


d 
1. (1+ P)— =1+y?. Hint: tan(x+y)= Tan yiany: 


dy dy 2 2 
2. a tod +y) 3. ait tty ty 
dW ray 43 _ dD _. 
4, an? 5. cosy sint 7 =siny cost 


In each of Problems 6-12, solve the given initial-value problem, and de- 
termine the interval of existence of each solution. 


d 
6. P(L+y?)+2y =0, y(0)=1 


dy 21 
7, = =—L_, 2) 53 
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dy _ 
+P = yt Py, yO)=l 


dy _317+41+2 


. > = __, y(do=- | 
* i dy-th 7? 
dy _ —tsiny _ 
10. cosy = lap” yd)=7/2 
dy 
11. a Kla- yb —-y), y(0)=0, a,b >0 


12. 


13. 


14, 


15. 


19, 


31 =ycost, y(1)=0 
Any differential equation of the form dy /dt=f(y) is separable. Thus, we can 
solve all those first-order differential equations in which time does not appear 
explicitly. Now, suppose we have a differential equation of the form dy/dt= 
f(y/t), such as, for example, the equation dy/dt=sin(y/t). Differential equa- 
tions of this form are called homogeneous equations. Since the right-hand side 
only depends on the single variable y/t, it suggests itself to make the substitu- 
tion y/t=v or y=tv. 

(a) Show that this substitution replaces the equation dy/dt=f(y/t) by the 

equivalent equation tdv/dt+v=f(v), which is separable. 
(b) Find the general solution of the equation dy /dt =2(y/t)+(y/0”. 


Determine whether each of the following functions of tf and y can be expressed 
as a function of the single variable y/t. 


2 34 3 

y+2y yori 
(a) —— (b) —— 

y yt°+y 

34 73 

yr tt tty 
c) ——— d) Iny —In¢+ ——~ 
©) a3 (d) Iny = 

eity ee 
(e) = (f) InVvr+y —InVr—-y 

tty (2471 $9y2) 7 
(g) sin t=y ( S145 


Solve the initial-value problem t(dy/dr)=y+ Vt?+y? . y(1)=0. 


Find the general solution of the following differential equations. 


yi, » dy 
. Wy =3y?=1 17. (¢— Vy )> =y 
dy tty 

dt t—y 


d 
e (y= +y(1+e'/”’)=0 


>— |] 
Hint: [een 1+ vel/° 
| ve '/° + 2 ( ) 
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20. Consider the differential equation 


dy _ttytl 


= * 
dt t-yt+30 (*) 


We could solve this equation if the constants 1 and 3 were not present. To 

eliminate these constants, we make the substitution f= 7+h, y= Y+k. 

(a) Determine h and k so that (*) can be written in the form dY/dT= 
(T+ Y)/(T — Y). 

(b) Find the general solution of (*). (See Exercise 18). 


21. (a) Prove that the differential equation 


dy _at+by+m 
dt ct+dyt+n 


where a, b, c, d, m, and n are constants, can always be reduced to dy/dt= 
(at + by)/(ct + dy) if ad— bc #0. 
(b) Solve the above equation in the special case that ad= bc. 


Find the general solution of the following equations. 
22. (l+1—2y)+(41—3y —6)dy/dt=0 
23. (t+2y +3)+(21+4y —l)dy/dt=0 


1.5 Population models 


In this section we will study first-order differential equations which govern 
the growth of various species. At first glance it would seem impossible to 
model the growth of a species by a differential equation since the popula- 
tion of any species always changes by integer amounts. Hence the popula- 
tion of any species can never be a differentiable function of time. How- 
ever, if a given population is very large and it is suddenly increased by one, 
then the change is very small compared to the given population. Thus, we 
make the approximation that large populations change continuously and 
even differentiably with time. 

Let p(t) denote the population of a given species at time ¢ and let r(t,p) 
denote the difference between its birth rate and its death rate. If this 
population is isolated, that is, there is no net immigration or emigration, 
then dp/dt, the rate of change of the population, equals rp(t). In the most 
simplistic model we assume that r is constant, that is, it does not change 
with either time or population. Then, we can write down the following dif- 
ferential equation governing population growth: 

dp (t) 


ae 4 (t), | a=constant. 


26 


1.5 Population models 


This is a linear equation and 1s known as the Malthusian law of population 
growth. If the population of the given species is pp) at time fo, then p(?) 
satisfies the initial-value problem dp(t)/dt=ap(t), p(t) =o. The solution 
of this initial-value problem is p(t) =p e*“~. Hence any species satisfying 
the Malthusian law of population growth grows exponentially with time. 

Now, we have just formulated a very simple model for population 
growth; so simple, in fact, that we have been able to solve it completely in a 
few lines. It is important, therefore, to see if this model, with its simplicity, 
has any relationship at all with reality. Let p(t) denote the human popula- 
tion of the earth at time ¢. It was estimated that the earth’s human 
population was increasing at an average rate of 2% per year during the 
period 1960-1970. Let us start in the middle of this decade on January 1, 
1965, at which time the U.S. Department of Commerce estimated the earth’s 
population to be 3.34 billion people. Then, t, =1965, py = 3.3410? and 
a =.02, so that 


p(t) — (3.34)10°e 02(t— 1965) _ 


One way of checking the accuracy of this formula is to compute the time 
required for the population of the earth to double, and then compare it to 
the observed value of 35 years. Our formula predicts that the population of 
the earth will double every T years, where 


e 2T—). 


Taking logarithms of both sides of this equation gives .027 = 1n2, so that 
T = 501n2 = 34.6 years. 


This is in excellent agreement with the observed value. On the other hand, 
though, let us look into the distant future. Our equation predicts that the 
earth’s population will be 200,000 billion in the year 2515, 1,800,000 billion 
in the year 2625, and 3,600,000 billion in the year 2660. These are astro- 
nomical numbers whose significance is difficult to gauge. The total surface 
of this planet is approximately 1,860,000 billion square feet. Eighty percent 
of this surface is covered by water. Assuming that we are willing to live on 
boats as well as land, it is easy to see that by the year 2515 there will be only 
9.3 square feet per person; by 2625 each person will have only one square 
foot on which to stand; and by 2660 we will be standing two deep on each 
other’s shoulders. 

It would seem therefore, that this model is unreasonable and should be 
thrown out. However, we cannot ignore the fact that it offers exceptional 
agreement in the past. Moreover, we have additional evidence that popula- 
tions do grow exponentially. Consider the Microtus Arvallis Pall, a small 
rodent which reproduces very rapidly. We take the unit of time to be a 
month, and assume that the population is increasing at the rate of 40% per 


27 


1 First-order differential equations 


month. If there are two rodents present initially at time ¢=0, then p(t), the 
number of rodents at time ¢, satisfies the initial-value problem 


dp(t)/dt=0.4p, ~—_ p(0) =2. 
Consequently, 


p(t)=2e°. (1) 


Table 1 compares the observed population with the population calculated 
from Equation (1). 


Table 1. The growth of Ts Arvallis Pall. 
Months 


p Observed 
p Calculated 


As one can see, there is excellent agreement. 


Remark. In the case of the Microtus Arvallis Pall, p observed is very ac- 
curate since the pregnancy period is three weeks and the time required for 
the census taking is considerably less. If the pregnancy period were very 
short then p observed could not be accurate since many of the pregnant ro- 
dents would have given birth before the census was completed. 


The way out of our dilemma is to observe that linear models for popula- 
tion growth are satisfactory as long as the population is not too large. 
When the population gets extremely large though, these models cannot be 
very accurate, since they do not reflect the fact that individual members 
are now competing with each other for the limited living space, natural re- 
sources and food available. Thus, we must add a competition term to our 
linear differential equation. A suitable choice of a competition term is 
— bp*, where b is a constant, since the statistical average of the number of 
encounters of two members per unit time is proportional to p?. We con- 
sider, therefore, the modified equation 


dp 5 
HPP . 


This equation is known as the logistic law of population growth and the 
numbers a,b are called the vital coefficients of the population. It was first 
introduced in 1837 by the Dutch mathematical-biologist Verhulst. Now, 
the constant b, in general, will be very small compared to a, so that if p is 
not too large then the term — bp? will be negligible compared to ap and the 
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population will grow exponentially. However, when p is very large, the 
term — bp? is no longer negligible, and thus serves to slow down the rapid 
rate of increase of the population. Needless to say, the more industrialized 
a nation is, the more living space it has, and the more food it has, the 
smaller the coefficient 5 1s. 

Let us now use the logistic equation to predict the future growth of an 
isolated population. If p, is the population at time fo, then p(t), the popula- 
tion at time ¢, satisfies the initial-value problem 


ap 
a — bp’, — P(t) = Po. 


This is a separable differential equation, and from Equation (10), Section 


1.4, 
= | ds=(t-t 
Ja 7 bP f ° 


To integrate the function 1/(ar— br?) we resort to partial fractions. Let 
| = 1 1A, B 
ar—br? r(a—br) r  a-br 
To find A and B, observe that 
A A Bo A(a-— br)+ Br _ Aa+(B-—bDA)r 
r  a-br r(a— br) r(a— br) 


Therefore, Aa+(B— bA)r=1. Since this equation is true for all values of r, 
we see that Aa=1 and B— bA =0. Consequently, A=1/a, B=b/a, and 


Pdr _ il ‘(4+ b Jar 
r(a—br) a r  a-obr 
Po Po 


a-—b a—b 
a} Po a—bp a Po| a— bp 
Thus, 
a(t=t))=In2. | 2 P (2) 
° Po| 4- 4p 


Now, it is a simple matter to show (see Exercise 1) that 


a— bpy 
a— bp(t) 
is always positive. Hence, 
a — bp 
t—t)=1 
a( o) ne a- a — bp’ 
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Taking exponentials of both sides of this equation gives 
eat to) = Pe a= Po 
Po a—bp 
or 
Po(a— bp)e*'~) = (a — bpg) p. 
Bringing all terms involving p to the left-hand side of this equation, we see 
that 
| a~— bp) + bpye*'~ p(t)= ape t\!~ ©, 
Consequently, 


ap yet! to) aD o 


P(t)= (3) 


a—bpyt bpye*'- — bpg + (a— bpp)e~ 8-1 
Let us now examine Equation (3) to see what kind of population it pre- 
dicts. Observe that as [> 0, 


aPo 
tho— = 
P(N FF 


ES) 


Thus, regardless of its initial value, the population always approaches the 
limiting value a/b. Next, observe that p(t) is a monotonically increasing 
function of time if 0<py<a/b. Moreover, since 


d*p dp dp 
We =a — 2bp— =(a—2bp) p(a— bp), 


we see that dp/dt is increasing if p(t)<.a/2b, and that dp /dt is decreasing 
if p(t)>a/2b. Hence, if pyp<a/2b, the graph of p(t) must have the form 
given in Figure 1. Such a curve is called a logistic, or S-shaped curve. 
From its shape we conclude that the time period before the population 
reaches half its limiting value is a period of accelerated growth. After this 
point, the rate of growth decreases and in time reaches zero. This is a 
period of diminishing growth. 

These predictions are borne out by an experiment on the protozoa 
Paramecium caudatum performed by the mathematical biologist G. F. 
Gause. Five individuals of Paramecium were placed in a small test tube 
containing 0.5 cm? of a nutritive medium, and for six days the number of 
individuals in every tube was counted daily. The Paramecium were found 
to increase at a rate of 230.9% per day when their numbers were low. The 
number of individuals increased rapidly at first, and then more slowly, un- 
til towards the fourth day it attained a maximum level of 375, saturating 
the test tube. From this data we conclude that if the Paramecium cauda- 
tum grow according to the logistic law dp / dt = ap — bp”, then a=2.309 and 
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OD 


om fa) 


<s p(t) 


to 


Figure |. Graph of p(t) 


b =2.309 /375. Consequently, the logistic law predicts that 


(2.309)5 
PU) = 7 309)5 (2.309)5 
A +.(2.309- a |p - 2.300% 
375 375 


___375 
1+ 74e~ 230%” 


(4) 


(We have taken the initial time ¢) to be 0.) Figure 2 compares the graph of 
p(t) predicted by Equation (4) with the actual measurements, which are de- 
noted by o. As can be seen, the agreement is remarkably good. 

In order to apply our results to predict the future human population of 
the earth, we must estimate the vital coefficients a and b in the logistic 
equation governing its growth. Some ecologists have estimated that the na- 
tural value of a is 0.029. We also know that the human population was in- 
creasing at the rate of 2% per year when the population was (3.34)10’. 
Since (1/p)(dp / dt) =a -— bp, we see that 

0.02 = a— b (3.34)10?. 
Consequently, b=2.695X10~'?. Thus, according to the logistic law of 


population growth, the human population of the earth will tend to the 


limiting value of 
a 0.029 
— = ———————_ = 10.76 billion people 
b 2695x1072 ee 


Note that according to this prediction, we were still on the accelerated 
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Figure 2. The growth of paramecium 


growth portion of the logistic curve in 1965, since we had not yet attained 
half the limiting population predicted for us. 


Remark. A student of mine once suggested that we use Equation (3) to find 
the time ¢ at which p(t) = 2, and then we can deduce how long ago mankind 
appeared on earth. On the surface this seems like a fantastic idea. However, 
we cannot travel that far backwards into the past, since our model is no 
longer accurate when the population is small. 


As another verification of the validity of the logistic law of population 
growth, we consider the equation 


_ 197,273,000 
P(t)= | + @ ~ 0.03134(7 — 1913.25) (5) 


which was introduced by Pearl and Reed as a model of the population 
growth of the United States. First, using the census figures for the years 
1790, 1850, and 1910, Pearl and Reed found from (3) (see Exercise 2a) that 
a=0.03134 and b=(1.5887)107 '°. Then (see Exercise 2b), Pearl and Reed 
calculated that the population of the United States reached half its limiting 
population of a/b = 197,273,000 in April 1913. Consequently (see Exercise 
2c), we can rewrite (3) in the simpler form (5). 

Table 2 below compares Pearl and Reed’s predictions with the observed 
values of the population of the United States. These results are remarkable, 
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Table 2. Population of the U.S. from 1790-1950. (The last 4 entries 
were added by the Dartmouth College Writing Group.) 


es 
0 [spasm [apm [0 
| 28.000 | Os 
| 02 

] 

] 

2 


a) 
2 
9 
6 


on 
[1840 [1,069,000 | 17506000. [aero | 26 


Oe 
io [ sera [sisremo [000 
123,124,000 


1930 122,775,000 349,000 
1940 131,669,000 136,653,000 4,984,000 
1950 150,697,000 149,053,000 — 1,644,000 pod 


especially since we have not taken into account the large waves of im- 
migration into the United States, and the fact that the United States was 
involved in five wars during this period. 

In 1845 Verhulst prophesied a maximum population for Belgium of 
6,600,000, and a maximum population for France of 40,000,000. Now, the 
population of Belgium in 1930 was already 8,092,000. This large dis- 
crepancy would seem to indicate that the logistic law of population growth 
is very inaccurate, at least as far as the population of Belgium is con- 
cerned. However, this discrepancy can be explained by the astonishing rise 
of industry in Belgium, and by the acquisition of the Congo which secured 
for the country sufficient additional wealth to support the extra popula- 
tion. Thus, after the acquisition of the Congo, and the astonishing rise of 
industry in Belgium, Verhulst should have lowered the vital coefficient b. 

On the other hand, the population of France in 1930 was in remarkable 
agreement with Verhulst’s forecast. Indeed, we can now answer the follow- 
ing tantalizing paradox: Why was the population of France increasing 
extremely slowly in 1930 while the French population of Canada was in- 
creasing very rapidly? After all, they are the same people! The answer to 
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this paradox, of course, is that the population of France in 1930 was very 
near its limiting value and thus was far into the period of diminishing 
growth, while the population of Canada in 1930 was still in the period of 
accelerated growth. 


Remark 1. It is clear that technological developments, pollution considera- 
tions and sociological trends have significant influence on the vital coef- 
ficients a and b. Therefore, they must be re-evaluated every few years. 


Remark 2. To derive more accurate models of population growth, we 
should not consider the population as made up of one homogeneous group 
of individuals. Rather, we should subdivide it into different age groups. We 
should also subdivide the population into males and females, since the re- 
production rate in a population usually depends more on the number of 
females than on the number of males. 


Remark 3. Perhaps the severest criticism leveled at the logistic law of 
population growth is that some populations have been observed to 
fluctuate periodically between two values, and any type of fluctuation is 
ruled out in a logistic curve. However, some of these fluctuations can be 
explained by the fact that when certain populations reach a sufficiently 
high density, they become susceptible to epidemics. The epidemic brings 
the population down to a lower value where it again begins to increase, 
until when it is large enough, the epidemic strikes again. In Exercise 10 we 
derive a model to describe this phenomenon, and we apply this model in 
Exercise 11 to explain the sudden appearance and disappearance of hordes 
of small rodents. 


Epilog. The following article appeared in the New York Times on March 
26, 1978, and was authored by Nick Eberstadt. 

The gist of the following article is that it is very difficult, using statistical 
methods alone, to make accurate population projections even 30 years into 
the future. In 1970, demographers at the United Nations projected the 
population of the earth to be 6.5 billion people by the year 2000. Only six 
years later, this forecast was revised downward to 5.9 billion people. 

Let us now use Equation (3) to predict the population of the earth in the 
year 2000. Setting a =.029, b = 2.695 X 107”, p) = 3.34. X 10°, t, = 1965, and 
t = 2,000 gives 

(.029)(3.34)10? 

009 + (.02)e7 69) 

29(3.34) 109 
9+20e7 !0! 
= 5.96 billion people! 


p(2000) = 


This is another spectacular application of the logistic equation. 
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1.5 Population models 
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EXERCISES 
1. Prove that (a— bpo)/(a— bp(t)) is positive for tg < t < 00. Hint: Use Equation (2) 


to show that p(f) can never equal a/b if pp#a/b. 


. (a) Choose 3 times fo, ¢,, and fy, with ¢, — tp =1f,—1¢,. Show that (3) determines a 


and b uniquely in terms of fo, p(to), ¢), P(t)), t, and p(t). 

(b) Show that the period of accelerated growth for the United States ended in 
April, 1913. ; 

(c) Let a population p(1) grow according to the logistic law (3), and let ¢ be the 
time at which half the limiting population is achieved. Show that 


a/b 


p(t)= ———___.. 
bee-a—) 


- In 1879 and 1881 a number of yearling bass were seined in New Jersey, taken 


across the continent in tanks by train, and planted in San Francisco Bay. A total 
of only 435 Striped Bass survived the rigors of these two trips. Yet, in 1899, the 
commercial net catch alone was 1,234,000 pounds. Since the growth of this 
population was so fast, it is reasonable to assume that it obeyed the Malthusian 
law dp / dt = ap. Assuming that the average weight of a bass fish is three pounds, 
and that in 1899 every tenth bass fish was caught, find a lower bound for a. 


. Suppose that a population doubles its original size in 100 years, and triples it in 


200 years. Show that this population cannot satisfy the Malthusian law of 
population growth. 


. Assume that p(t) satisfies the Malthusian law of population growth. Show that 


the increases in p in successive time intervals of equal duration form the terms of 
a geometric progression. This is the source of Thomas Malthus’ famous dictum 
“Population when unchecked increases in a geometrical ratio. Subsistence 
increases only in an arithmetic ratio. A slight acquaintance with numbers will 
show the immensity of the first power in comparison of the second.” 


. A population grows according to the logistic law, with a limiting population of 


5 x 10® individuals. When the population is low it doubles every 40 minutes. 
What will the population be after two hours if initially it is (a) 10°, (b) 10°? 


. A family of salmon fish living off the Alaskan Coast obeys the Malthusian law 


of population growth dp(t)/dt=0.003p(t), where ¢ is measured in minutes. At 
time t=0 a group of sharks establishes residence in these waters and begins 
attacking the salmon. The rate at which salmon are killed by the sharks is 
0.001p?(t), where p(t) is the population of salmon at time ¢. Moreover, since an 
undesirable element has moved into their neighborhood, 0.002 salmon per 
minute leave the Alaskan waters. 

(a) Modify the Malthusian law of population growth to take these two factors 

into account. 
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(b) Assume that at time t=0 there are one million salmon. Find the population 
p(t). What happens as too? 

(c) Show that the above model is really absurd. Hint: Show, according to this 
model, that the salmon population decreases from one million to about one 
thousand in one minute. 


8. The population of New York City would satisfy the logistic law 


dp | l 


fae pe 2 
dt 25? (25)10°” ’ 


where ¢ is measured in years, if we neglected the high emigration and homicide 

rates. 

(a) Modify this equation to take into account the fact that 9,000 people per year 
move from the city, and 1,000 people per year are murdered. 

(b) Assume that the population of New York City was 8,000,000 in 1970. Find 
the population for all future time. What happens as t-00? 


9. An initial population of 50,000 inhabits a microcosm with a carrying capacity of 


10. 


100,000. After five years, the population has increased to 60,000. Show that the 
natural growth rate a for this population is (1 /5)ln3/2. 


We can model a population which becomes susceptible to epidemics in the 
following manner. Assume that our population is originally governed by the lo- 
gistic law 


dp 5 
wee (i) 


and that an epidemic strikes as soon as p reaches a certain value Q, with Q less 
than the limiting population a/b. At this stage the vital coefficients become 
A<a, B<b, and Equation (i) is replaced by 

e = Ap — Bp?. (ii) 
Suppose that Q > A/B. The population then starts decreasing. A point is 
reached when the population falls below a certain value q>A/B. At this mo- 
ment the epidemic ceases and the population again begins to grow following 
Equation (i), until the incidence of a fresh epidemic. In this way there are peri- 
odic fluctuations of p between g and Q. We now indicate how to calculate the 
period 7 of these fluctuations. 


(a) Show that the time 7, taken by the first part of the cycle, when p increases 
from q to Q is given by 
1, Q (4-64) 
T, = —ln ———_-.. 
a q(a—bQ) 
(b) Show that the time T, taken by the second part of the cycle, when p de- 
creases from Q to q 1s given by 
li q(QB-— A) 
A Q(qB-A)- 
Thus, the time for the entire cycle is T, + 7). 


T, 
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12. 


1.6 The spread of technological innovations 


It has been observed that plagues appear in mice populations whenever the 
population becomes too large. Further, a local increase of density attracts preda- 
tors in large numbers. These two factors will succeed in destroying 97-98% of a 
population of small rodents in two or three weeks, and the density then falls to a 
level at which the disease cannot spread. The population, reduced to 2% of its 
maximum, finds its refuges from the predators sufficient, and its food abundant. 
The population therefore begins to grow again until it reaches a level favorable 
to another wave of disease and predation. Now, the speed of reproduction in 
mice is so great that we may set b=0 in Equation (i) of Exercise 7. In the second 
part of the cycle, on the contrary, A is very small in comparison with B, and it 
may be neglected therefore in Equaticn (ii). 

(a) Under these assumptions, show that 


(b) Assuming that 7, is approximately four years, and Q/q is approximately 
fifty, show that a is approximately one. This value of a, incidentally, corre- 
sponds very well with the rate of multiplication of mice in natural circums- 
tances. 


There are many important classes of organisms whose birth rate is not propor- 
tional to the population size. Suppose, for example, that each member of the 
population requires a partner for reproduction, and that each member relies on 
chance encounters for meeting a mate. If the expected number of encounters is 
proportional to the product of the numbers of males and females, and if these 
are equally distributed in the population, then the number of encounters, and 
hence the birthrate too, is proportional to p?. The death rate is still proportional 
to p. Consequently, the population size p(t) satisfies the differential equation 


dp 
_ = hp? — 

Th bp“—ap, a,b>0. 

Show that p(t) approaches 0 as t->00 if py< a/b. Thus, once the population size 
drops below the critical size a/b, the population tends to extinction. Thus, a 
species is classified endangered if its current size is perilously close to its critical 
Size. 


1.6 The spread of technological innovations 


Economists and sociologists have long been concerned with how a techno- 
logical change, or innovation, spreads in an industry. Once an innovation 
is introduced by one firm, how soon do others in the industry come to 
adopt it, and what factors determine how rapidly they follow? In this sec- 
tion we construct a model of the spread of innovations among farmers, 
and then show that this same model also describes the spread of innova- 
tions in such diverse industries as bituminous coal, iron and steel, brewing, 
and railroads. 


Assume that a new innovation is introduced into a fixed community of 


N farmers at time t=0. Let p(t) denote the number of farmers who have 
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adopted at time ¢. As in the previous section, we make the approximation 
that p(t) is a continuous function of time, even though it obviously changes 
by integer amounts. The simplest realistic assumption that we can make 
concerning the spread of this innovation is that a farmer adopts the in- 
novation only after he has been told of it by a farmer who has already 
adopted. Then, the number of farmers Ap who adopt the innovation in a 
small time interval At is directly proportional to the number of farmers p 
who have already adopted, and the number of farmers N—p who are as 
yet unaware. Hence, Ap = cp(N — p)At or Ap /At=cp(N — p) for some posi- 
tive constant c. Letting At—0, we obtain the differential equation 


d 
— = p(N~p). (1) 


This is the logistic equation of the previous section if we set a=cN, b=c. 
Assuming that p(0)= 1; 1.e., one farmer has adopted the innovation at time 
t=(, we see that p(Z) satisfies the initial-value problem 


dp 
= =cp(N-p), — p(0)=1. (2) 
The solution of (2) is 


Nes™ 
N-1+e™ 


which is a logistic function (see Section 1.5). Hence, our model predicts 
that the adoption process accelerates up to that point at which half the 
community is aware of the innovation. After this point, the adoption pro- 
cess begins to decelerate until it eventually reaches zero. 

Let us compare the predictions of our model with data on the spread of 
two innovations through American farming communities in the middle 
1950’s. Figure 1 represents the cumulative number of farmers in Iowa 
during 1944-1955 who adopted 2,4-D weed spray, and Figure 2 represents 
the cumulative percentage of corn acreage in hybrid corn in three Ameri- 
can states during the years 1934-1958. The circles in these figures are the 
actual measurements, and the graphs were obtained by connecting these 
measurements with straight lines. As can be seen, these curves have all the 
properties of logistic curves, and on the whole, offer very good agreement 
with our model. However, there are two discrepancies. First, the actual 
point at which the adoption process ceases to accelerate is not always when 
fifty per cent of the population has adopted the innovation. As can be seen 
from Figure 2, the adoption process for hybrid corn began to decelerate in 
Alabama only after nearly sixty per cent of the farmers had adopted the 
innovation. Second, the agreement with our model is much better in the 
later stages of the adoption process than in the earlier stages. 

The source of the second discrepancy is our assumption that a farmer 
only learns of an innovation through contact with another farmer. This is 
not entirely true. Studies have shown that mass communication media such 


p(= (3) 
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Figure 1. Cumulative number of farmers who adopted 2,4-D weed spray in Iowa 
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Figure 2. Cumulative percentage of corn acreage in hybrid corn in three American 
States 
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as radio, television, newspapers and farmers’ magazines play a large role in 
the early stages of the adoption process. Therefore, we must add a term to 
the differential equation (1) to take this into account. To compute this 
term, we assume that the number of farmers Ap who learn of the innova- 
tion through the mass communication media in a short period of time At is 
proportional to the number of farmers who do not yet know; .e., 


Ap=c'(N—p)At 


for some positive constant c’. Letting At-0, we see that c’(N — p) farmers, 
per unit time, learn of the innovation through the mass communication 
media. Thus, if p(0)=0, then p(t) satisfies the initial-value problem 


& =cp(N-p)+c'(N—p),  p(0)=0. (4) 


The solution of (4) is 
Ne'[ ele ten —] ] 
cN+ cele ten yt” (5) 


and in Exercises 2 and 3 we indicate how to determine the shape of the 
curve (5). 

The corrected curve (5) now gives remarkably good agreement with Fig- 
ures | and 2, for suitable choices of c and c’. However, (see Exercise 3c) it 
still doesn’t explain why the adoption of hybrid corn in Alabama only be- 
gan to decelerate after sixty per cent of the farmers had adopted the in- 
novation. This indicates, of course, that other factors, such as the time in- 
terval that elapses between when a farmer first learns of an innovation and 
when he actually adopts it, may play an important role in the adoption 
process, and must be taken into account in any model. 

We would now like to show that the differential equation 


dp / dt = cp(N — p) 

also governs the rate at which firms in such diverse industries as bi- 
tuminous coal, iron and steel, brewing, and railroads adopted several 
major innovations in the first part of this century. This is rather surprising, 
since we would expect that the number of firms adopting an innovation in 
one of these industries certainly depends on the profitability of the innova- 
tion and the investment required to implement it, and we haven’t men- 
tioned these factors in deriving Equation (1). However, as we shall see 
shortly, these two factors are incorporated in the constant c. 

Let n be the total number of firms in a particular industry who have 
adopted an innovation at time ¢. It is clear that the number of firms Ap 
who adopt the innovation in a short time interval At is proportional to the 
number of firms n — p who have not yet adopted; i.e., Ap =A(n — p)At. Let- 
ting At—0, we see that 


p(t)= 
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The proportionality factor A depends on the profitability 7 of installing this 
innovation relative to that of alternative investments, the investment s re- 
quired to install this innovation as a percentage of the total assets of the 
firm, and the percentage of firms who have already adopted. Thus, 


A=f(7,s,p/n). 


Expanding f in a Taylor series, and dropping terms of degree more than 
two, gives 


A=a,+a,7+a35+ a= + asm? + ags* + azms 


Pp p p\’ 
+ —)+ays| —)+a,| —}. 
agn( =) +a5s() + a.0( =) 


In the late 1950’s, Edwin Mansfield of Carnegie Mellon University in- 
vestigated the spread of twelve innovations in four major industries. From 
his exhaustive studies, Mansfield concluded that a,)=0 and 
a, ta,7+a,s+ asm? + as? + ajns =0. 
Thus, setting 
k=a,t+a,7+ ays, (6) 
we see that F 
iP P 
— =k—(n-p). 
aK (n-P) 
(This is the equation obtained previously for the spread of innovations 
among farmers, if we set k/n=c.) We assume that the innovation is first 
adopted by one firm in the year f). Then, p(¢) satisfies the initial-value 
problem 


dp sk 
Wo Pine) P(t) =1 (7) 


and this implies that 
n 


1+(n—l)eWke- 0) 


Mansfield studied how rapidly the use of twelve innovations spread 
from enterprise to enterprise in four major industries—bituminous coal, 
iron and steel, brewing, and railroads. The innovations are the shuttle car, 
trackless mobile loader, and continuous mining machine (in bituminous 
coal); the by-product coke oven, continuous wide strip mill, and continu- 
ous annealing line for tin plate (in iron and steel); the pallet-loading 
machine, tin container, and high speed bottle fillez (an brewing); and the 
diesel locomotive, centralized traffic control, and car retarders (in rail- 
roads). His results are described graphically in Figure 3. For all but the 
by-product coke oven and tin container, the percentages given are for ev- 
ery two years from the year of initial introduction. The length of the inter- 
val for the by-product coke oven is about six years, and for the tin con- 
tainer, it is six months. Notice how all these curves have the general ap- 
pearance of a logistic curve. 


p(t)= 
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Figure 3. Growth in the percentage of major firms that introduced twelve in- 
novations; bituminous coal, iron and steel, brewing, and railroad industries, 
1890-1958; (a) By-product coke oven (CQ), diesel locomotive (DL), tin container 
(TC), and shuttle car (SC); (b) Car retarder (CR), trackless mobile loader (ML), 
continuous mining machine (CM), and pallet-loading machine (PL); (c) Continu- 
ous wide-strip mill (SM), centralized traffic control (CTC), continuous annealing 
(CA), and highspeed bottle filler (BF). 


1.6 The spread of technological innovations 


Table 1. 


Innovation 


Diesel locomotive 

Centralized traffic 
control 

Car retarders 


Continuous wide 

strip mill 
By-product coke 

oven 
Continuous annealing 


Shuttle car 

Trackless mobile 
loader 

Continuous mining 
machine 


Tin container 

High speed bottle 
filler 

Pallet-loading 
machine 


For a more detailed comparison of the predictions of our model (7) with 
these observed results, we must evaluate the constants n, k, and ty for each 
of the twelve innovations. Table | gives the value of n, fo, a4, ds, Ag, 7, and 
s for each of the twelve innovations; the constant k can then be computed 
from Equation (6). As the answers to Exercises 5 and 6 will indicate, our 
model (7) predicts with reasonable accuracy the rate of adoption of these 
twelve innovations. 


Reference 


Mansfield, E., “Technical change and the rate of imitation,” Econometrica, Vol. 29, 
No. 4, Oct. 1961. 


EXERCISES 
1. Solve the initial-value problem (2). 


2. Let c=0 in (5). Show that p(s) increases monotonically from 0 to N, and has no 
points of inflection. 


3. Here is a heuristic argument to determine the behavior of the curve (5). If c’ =0, 
then we have a logistic curve, and if c=0, then we have the behavior described 
in Exercise 2. Thus, if c is large relative to c’, then we have a logistic curve, and 
if c is small relative to c’ then we have the behavior illustrated in Exercise 2. 
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(a) Let p(Z) satisfy (4). Show that 


d*p 
72 =(N-—p)(cp+c’)(cN —2cp—c’). 
(b) Show that p(t) has a point of inflection, at which dp/dt achieves a maxi- 
mum, if, and only if, c’/c<N. 


(c) Assume that p(t) has a point of inflection at r=7¢*. Show that p(t*) < N/2. 
4. Solve the initial-value problem (7). 


5. It seems reasonable to take the time span between the date when 20% of the 
firms had introduced the innovation and the date when 80% of the firms had in- 
troduced the innovation, as the rate of imitation. 

(a) Show from our model that this time span is 4(1n2)/k. 
(b) For each of the twelve innovations, compute this time span from the data in 
Table 1, and compare with the observed value in Figure 3. 


6. (a) Show from our model that (1/k)In(m— 1) years elapse before 50% of the 
firms introduce an innovation. 


(b) Compute this time span for each of the 12 innovations and compare with the 
observed values in Figure 3. 


1.7 An atomic waste disposal problem 


For several years the Atomic Energy Commission (now known as the 
Nuclear Regulatory Commission) had disposed of concentrated radioac- 
tive waste material by placing it in tightly sealed drums which were then 
dumped at sea in fifty fathoms (300 feet) of water. When concerned ecolo- 
gists and scientists questioned this practice, they were assured by the 
A.E.C. that the drums would never develop leaks. Exhaustive tests on the 
drums proved the A.E.C. right. However, several engineers then raised the 
question of whether the drums could crack from the impact of hitting the 
ocean floor. “Never,” said the A.E.C. “We'll see about that,” said the en- 
gineers. After performing numerous experiments, the engineers found that 
the drums could crack on impact if their velocity exceeded forty feet per 
second. The problem before us, therefore, is to compute the velocity of the 
drums upon impact with the ocean floor. To this end, we digress briefly to 
study elementary Newtonian mechanics. 

Newtonian mechanics is the study of Newton’s famous laws of motion 
and their consequences. Newton’s first law of motion states that an object 
will remain at rest, or move with constant velocity, if no force is acting on 
it. A force should be thought of as a push or pull. This push or pull can be 
exerted directly by something in contact with the object, or it can be ex- 
erted indirectly, as the earth’s pull of gravity is. 

Newton’s second law of motion is concerned with describing the motion 
of an object which is acted upon by several forces. Let y(t) denote the 
position of the center of gravity of the object. (We assume that the object 
moves in only one direction.) Those forces acting on the object, which tend 
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to increase y, are considered positive, while those forces tending to de- 
crease y are considered negative. The resultant force F acting on an object 
is defined to be the sum of all positive forces minus the sum of all negative 
forces. Newton’s second law of motion states that the acceleration d’y / dt? 
of an object is proportional to the resultant force F acting on it; Le., 
2 

av _ip (1) 

dt? om 
The constant m is the mass of the object. It is related to the weight W of 
the object by the relation W=m_g, where g is the acceleration of gravity. 
Unless otherwise stated, we assume that the weight of an object and the ac- 
celeration of gravity are constant. We will also adopt the English system of 
units, so that ¢ is measured in seconds, y is measured in feet, and F is 
measured in pounds. The units of m are then slugs, and the gravitational 
acceleration g equals 32.2 ft/s’. 


Remark. We would prefer to use the mks system of units, where y is 
measured in meters and F is measured in newtons. The units of m are then 
kilograms, and the gravitational acceleration equals 9.8 m/s’. In the third 
edition of this text, we have changed from the English system of units to the 
mks system in Section 2.6. However, changing to the mks system in this 
section would have caused undue confusion to the users of the first and 
second editions. This is because of the truncation error involved in convert- 
ing from feet to meters and pounds to newtons. 


We return now to our atomic waste disposal problem. As a drum de- 
scends through the water, it is acted upon by three forces W, B, and D. 
The force W is the weight of the drum pulling it down, and in magnitude, 
W = 527.436 lb. The force B 1s the buoyancy force of the water acting on 
the drum. This force pushes the drum up, and its magnitude is the weight 
of the water displaced by the drum. Now, the Atomic Energy Commission 
used 55 gallon drums, whose volume is 7.35 ft?. The weight of one cubic 
foot of salt water is 63.99 lb. Hence B=(63.99)(7.35) = 470.327 Ib. 

The force D is the drag force of the water acting on the drum; it resists 
the motion of the drum through the water. Experiments have shown that 
any medium such as water, oil, and air resists the motion of an object 
through it. This resisting force acts in the direction opposite the motion, 
and is usually directly proportional to the velocity V of the object. Thus, 
D=cV, for some positive constant c. Notice that the drag force increases 
as V increases, and decreases as V decreases. To calculate D, the engineers 
conducted numerous towing experiments. They concluded that the orienta- 
tion of the drum had little effect on the drag force, and that 


(Ib)(s) 


D=0.08V , 
ft 
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Now, set y =0 at sea level, and let the direction of increasing y be down- 
wards. Then, W is a positive force, and B and D are negative forces. Con- 
sequently, from (1), 

d*y g 

— =—(W-B-cV )=—,(W-B-cV). 

ae og (WB eV) = Fo cV) 
We can rewrite this equation as a first-order linear differential equation for 
V=dy/dt; ie., 


a” 4 Sy= 5 (w-s). (2) 


Initially, when the drum is released in the ocean, its velocity is zero. Thus, 
V(t), the velocity of the drum, satisfies the initial-value problem 


Set pla wpW-B) — V(0)=0, (3) 
and this implies that 
W-—B ~¢ 
V (t)=—— [1 -e' B/W )t) (4) 


Equation (4) expresses the velocity of the drum as a function of time. 
In order to determine the impact velocity of the drum, we must compute 
the time ¢ at which the drum hits the ocean floor. Unfortunately, though, it 
is impossible to find ¢ as an explicit function of y (see Exercise 2). There- 
fore, we cannot use Equation (4) to find the velocity of the drum when it 
hits the ocean floor. However, the A.E.C. can use this equation to try and 
prove that the drums do not crack on impact. To wit, observe from (4) that 
V(t) is a monotonic increasing function of time which approaches the 
limiting value WR 

V>= . 
as ¢ approaches infinity. The quantity V- is called the terminal velocity of 
the drum. Clearly, V (7) < V;, so that the velocity of the drum when it hits 
the ocean floor is certainly less than (W—B)/c. Now, if this terminal 
velocity is less than 40 ft/s, then the drums could not possibly break on 
impact. However, 


WB _ 527.436—470.327 
c 0.08 


and this is way too large. 

It should be clear now that the only way we can resolve the dispute be- 
tween the A.E.C. and the engineers is to find v(y), the velocity of the drum 
as a function of position. The function v(y) is very different from the func- 
tion V(t), which is the velocity of the drum as a function of time. How- 
ever, these two functions are related through the equation 


V(t)=o(y(2)) 


= 713.86 ft/s, 
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if we express y as a function of ¢. By the chain rule of differentiation, 

dV / dt =(dv / dy)(dy / dt). Hence 
W dv dy 
g dy di =W-—B-cV. 

But dy /dt= V (t)= v(y(d)). Thus, suppressing the dependence of y on ft, we 

see that v(y) satisfies the first-order differential equation 


W adv v dv _ & 
go dy BO OT W—B—cv dy 7 a 
Moreover, 
v(0) = v(y(0))=V 0) =0. 
Hence, 
f —rdr * 8 ga Fe 
W-B-cr — o W W 
Now, 
[ rdr r—(W B)/c | wae dr 
0 W—B-cer W-—B-cr 0 W-—B-cr 
1 fe W-—B (° dr 
—_ [are c 9 W-B-er 
p (W-B) |W-B-co| 
ns ae, 


We know already that v<(W— B)/c. Consequently, W— B— cv is always 
positive, and 

by _ vp _ (WB) W-—B-co (5) 

W ¢ C2 1 WB 

At this point, we are ready to scream in despair since we cannot find v 

as an explicit function of y from (5). This is not an insurmountable diffi- 

culty, though. As we show in Section 1.11, it is quite simple, with the aid of 

a digital computer, to find v(300) from (5). We need only supply the com- 

puter with a good approximation of v(300) and this is obtained in the 

following manner. The velocity v(y) of the drum satisfies the initial-value 


problem 
Tog =W- B—co, v(0)=0. (6) 


Let us, for the moment, set c=0 in (6) to obtain the new initial-value prob- 


lem Wd 
u 
—u—=W-B, u(0O)=0. 6’ 
ae (0) (6) 
(We have replaced v by u to avoid confusion later.) We can integrate (6’) 
immediately to obtain that 
W u? 2g ve 
Ty W-B)y, oF uv=| FW B)y] 
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In particular, 


2(32.2)(57.109)(300) 1/2 


2g 1/2 
u(300)=| (WW — B)300 | -| 357 436 


=V2092 =45.7 ft/s. 


We claim, now, that u(300) is a very good approximation of v(300). The 
proof of this is as follows. First, observe that the velocity of the drum is 
always greater if there is no drag force opposing the motion. Hence, 


»(300) < u(300). 


Second, the velocity v increases as y increases, so that v(y) < v(300) for y < 
300. Consequently, the drag force D of the water acting on the drum is al- 
ways less than 0.08 x u(300) =3.7 Ib. Now, the resultant force W— B pull- 
ing the drum down is approximately 57.1 Ib, which is very large compared 
to D. It stands to reason, therefore, that u(y) should be a very good ap- 
proximation of u(y). And indeed, this is the case, since we find numeri- 
cally (see Section 1.11) that »(300)=45.1 ft/s. Thus, the drums can break 
upon impact, and the engineers were right. 

Epilog. The rules of the Atomic Energy Commission now expressly for- 
bid the dumping of low level atomic waste at sea. This author is uncertain 
though, as to whether Western Europe has also forbidden this practice. 


Remark. The methods introduced in this section can also be used to find 
the velocity of any object which is moving through a medium that resists 
the motion. We just disregard the buoyancy force if the medium is not 
water. For example, let V(t) denote the velocity of a parachutist falling to 
earth under the influence of gravity. Then, 


where W is the weight of the man and the parachute, and D is the drag 
force exerted by the atmosphere on the falling parachutist. The drag force 
on a bluff object in air, or in any fluid of small viscosity is usually very 
nearly proportional to V. Proportionality to V is the exceptional case, and 
occurs only at very low speeds. The criterion as to whether the square or 
the linear law applies is the “Reynolds number” 


R=pVL/p. 


L is a representative length dimension of the object, and p and p are the 
density and viscosity of the fluid. If R<10, then D~V, and if R >10°, 
D~V*. For 10< R< 10, neither law is accurate. 
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EXERCISES 


1. 
2. 


10. 


11. 


Solve the initial-value problem (3). 


Solve for y= y(t) from (4), and then show that the equation y = y(t) cannot be 
solved explicitly for t= ¢()). 


. Show that the drums of atomic waste will not crack upon impact if they are 


dropped into L feet of water with (2g(W— B)L/W)'/* < 40. 


. Fat Richie, an enormous underworld hoodlum weighing 400 Ib, was pushed out 


of a penthouse window 2800 feet above the ground in New York City. Neglect- 
ing air resistance find (a) the velocity with which Fat Richie hit the ground; (b) 
the time elapsed before Fat Richie hit the ground. 


. An object weighing 300 Ib is dropped into a river 150 feet deep. The volume of 


the object is 2 ft?, and the drag force exerted by the water on it is 0.05 times its 
velocity. The drag force may be considered negligible if it does not exceed 5% 
of the resultant force pulling the drum down. Prove that the drag force is negli- 
gible in this case. (Here B =2(62.4) = 124.8.) 


. A 400 lb sphere of volume 47/3 and a 300 Ib cylinder of volume 7 are simulta- 


neously released from rest into a river. The drag force exerted by the water on 
the falling sphere and cylinder is AV, and AV,, respectively, where V, and V, 
are the velocities of the sphere and cylinder, and A is a positive constant. De- 
termine which object reaches the bottom of the river first. 


. A parachutist falls from rest toward earth. The combined weight of man and 


parachute is 161 lb. Before the parachute opens, the air resistance equals V//2. 
The parachute opens 5 seconds after the fall begins; and the air resistance is 
then V?/2. Find the velocity V(r) of the parachutist after the parachute opens. 


. A man wearing a parachute jumps from a great height. The combined weight 


of man and parachute is 161 lb. Let V(t) denote his speed at time ¢ seconds 
after the fall begins. During the first 10 seconds, the air resistance is V/2. 
Thereafter, while the parachute is open, the air resistance is 10V. Find an ex- 
plicit formula for V(t) at any time ¢ greater than 10 seconds. 


. An object of mass m is projected vertically downward with initial velocity V, in 


a medium offering resistance proportional to the square root of the magnitude 

of the velocity. 

(a) Find a relation between the velocity V and the time ¢ if the drag force 
equals cVV . 

(b) Find the terminal velocity of the object. Hint: You can find the terminal 
velocity even though you cannot solve for V (2). 


A body of mass m falls from rest in a medium offering resistance proportional 
to the square of the velocity; that is, D=cV*. Find V(t) and compute the 
terminal velocity V. 


A body of mass m is projected upward from the earth’s surface with an initial 
velocity Vj. Take the y-axis to be positive upward, with the origin on the 
surface of the earth. Assuming there is no air resistance, but taking into 
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account the variation of the earth’s gravitational field with altitude, we obtain 
that 
dV _____—mgR? 


mee on 
di (yt RY 
where R is the radius of the earth. 
(a) Let V(t)=v()(d)). Find a differential equation satisfied by v(y). 
(b) Find the smallest initial velocity Vo for which the body will not return to 
earth. This is the so-called escape velocity. Hint: The escape velocity is 
found by requiring that v(y) remain strictly positive. 


12, It is not really necessary to find v(y) explicitly in order to prove that v(300) 
exceeds 40 ft/s. Here is an alternate proof. Observe first that v(y) increases as 
y increases. This implies that y is a monotonic increasing function of v. There- 
fore, if y is less than 300 ft when v is 40 ft/s, then v must be greater than 40 
ft/s when y is 300 ft. Substitute »o =40 ft/s in Equation (5), and show that y is 
less than 300 ft. Conclude, therefore, that the drums can break upon impact. 


1.8 The dynamics of tumor growth, mixing problems 
and orthogonal trajectories 


In this section we present three very simple but extremely useful applica- 
tions of first-order equations. The first application concerns the growth of 
solid tumors; the second application is concerned with “mixing problems” 
or “compartment analysis”; and the third application shows how to find a 
family of curves which is orthogonal to a given family of curves. 


(a) The dynamics of tumor growth 


It has been observed experimentally, that “free living” dividing cells, such 
as bacteria cells, grow at a rate proportional to the volume of dividing cells 
at that moment. Let V(t) denote the volume of dividing cells at time ¢. 
Then, 


dV _ 
Th =\V (1) 
for some positive constant A. The solution of (1) is 
V (t)=V e- (2) 


where Vj is the volume of dividing cells at the initial time f,. Thus, free 
living dividing cells grow exponentially with time. One important con- 
sequence of (2) is that the volume of cells keeps doubling (see Exercise 1) 
every time interval of length In2/). 

On the other hand, solid tumors do not grow exponentially with time. 
As the tumor becomes larger, the doubling time of the total tumor volume 
continuously increases. Various researchers have shown that the data for 
many solid tumors is fitted remarkably well, over almost a 1000 fold in- 
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crease in tumor volume, by the equation 


V (1) = Voexp( (1 -exp(— ar))] (3) 


where exp(x)=e*, and 4 and a are positive constants. 

Equation (3) is usually referred to as a Gompertzian relation. It says 
that the tumor grows more and more slowly with the passage of time, and 
that it ultimately approaches the limiting volume V,e*/*. Medical re- 
searchers have long been concerned with explaining this deviation from 
simple exponential growth. A great deal of insight into this problem can be 
gained by finding a differential equation satisfied by V(t). Differentiating 
(3) gives 

dV 


a Vy Aexp(— at) exp( (1 — exp(— a(1)))) 


=)\e “Y. (4) 


Two conflicting theories have been advanced for the dynamics of tumor 
growth. They correspond to the two arrangements 


a = (he )V (4a) 
dV —at 
a =Ne-"'V) (4b) 


of the differential equation (4). According to the first theory, the retarding 
effect of tumor growth is due to an increase in the mean generation time of 
the cells, without a change in the proportion of reproducing cells. As time 
goes on, the reproducing cells mature, or age, and thus divide more slowly. 
This theory corresponds to the bracketing (a). 

The bracketing (b) suggests that the mean generation time of the divid- 
ing cells remains constant, and the retardation of growth is due to a loss in 
reproductive cells in the tumor. One possible explanation for this is that a 
necrotic region develops in the center of the tumor. This necrosis appears at 
a critical size for a particular type of tumor, and thereafter the necrotic 
“core” increases rapidly as the total tumor mass increases. According to 
this theory, a necrotic core develops because in many tumors the supply of 
blood, and thus of oxygen and nutrients, is almost completely confined to 
the surface of the tumor and a short distance beneath it. As the tumor 
grows, the supply of oxygen to the central core by diffusion becomes more 
and more difficult resulting in the formation of a necrotic core. 


(b) Mixing problems 


Many important problems in biology and engineering can be put into the 
following framework. A solution containing a fixed concentration of sub- 
stance x flows into a tank, or compartment, containing the substance x 
and possibly other substances, at a specified rate. The mixture is stirred 
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together very rapidly, and then leaves the tank, again at a specified rate. 
Find the concentration of substance x in the tank at any time ¢. 

Problems of this type fall under the general heading of “mixing prob- 
lems,” or compartment analysis. The following example illustrates how to 
solve these problems. 


Example 1. A tank contains S, |b of salt dissolved in 200 gallons of water. 
Starting at time ¢=0, water containing ; lb of salt per gallon enters the 
tank at the rate of 4 gal/min, and the well stirred solution leaves the tank 
at the same rate. Find the concentration of salt in the tank at any time ¢> 
0. 

Solution. Let S(t) denote the amount of salt in the tank at time ¢. Then, 
S’(t), which is the rate of change of salt in the tank at time ¢, must equal 
the rate at which salt enters the tank minus the rate at which it leaves the 
tank. Obviously, the rate at which salt enters the tank is 


+ lb/gal times 4 gal/min =2 Ib/min. 


After a moment’s reflection, it is also obvious that the rate at which salt 
leaves the tank is 


| S(t) 
4 gal/min times 700° 
Thus 
s(t) =2— 2 S(0)=S 
50° 
and this implies that 
S(t) = Sye~°! + 100(1 — e797"). (5) 
Hence, the concentration c(t) of salt in the tank is given by 
o(1)= SO = SY gory 11 — 002, (6) 


Remark. The first term on the right-hand side of (5) represents the por- 
tion of the original amount of salt remaining in the tank at time ¢. This 
term becomes smaller and smaller with the passage of time as the original 
solution is drained from the tank. The second term on the right-hand side 
of (5) represents the amount of salt in the tank at time ¢ due to the action 
of the flow process. Clearly, the amount of salt in the tank must ultimately 
approach the limiting value of 100 lb, and this is easily verified by letting ¢ 
approach oo in (5). 


(c) Orthogonal trajectories 


In many physical applications, it is often necessary to find the orthogonal 
trajectories of a given family of curves. (A curve which intersects each 
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member of a family of curves at right angles is called an orthogonal trajec- 
tory of the given family.) For example, a charged particle moving under 
the influence of a magnetic field always travels on a curve which is per- 
pendicular to each of the magnetic field lines. The problem of computing 
orthogonal trajectories of a family of curves can be solved in the following 
manner. Let the given family of curves be described by the relation 


F(x,y,c)=0. (7) 
Differentiating this equation yields 
/ / FY 
F.+ Fyy’=90, or a a (8) 


Next, we solve for c=c(x,y) from (7) and replace every c in (8) by this 
value c(x,y). Finally, since the slopes of curves which intersect orthogo- 
nally are negative reciprocals of each other, we see that the orthogonal 
trajectories of (7) are the solution curves of the equation 


yas. (9) 
Example 2. Find the orthogonal trajectories of the family of parabolas 
x=cy’. 


Solution. Differentiating the equation x= cy” gives 1=2cyy’. Since c= 
x /y?, we see that y’=y/2x. Thus, the orthogonal trajectories of the family 


Figure 1. The parabolas x = cy? and their orthogonal trajectories 
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of parabolas x = cy? are the solution curves of the equation 


2x 
(=e, 10 
y y (10) 


This equation is separable, and its solution is 


yr t+2x*=k?, (11) 


Thus, the family of ellipses (11) (see Figure 1) are the orthogonal trajecto- 


ries 


of the family of parabolas x = cy’. 


Reference 


Burton, Alan C., Rate of growth of solid tumors as a problem of diffusion, Growth, 
1966, vol. 30, pp. 157-176. 


EXERCISES 


1. 
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A given substance satisfies the exponential growth law (1). Show that the graph 
of InV versus ¢ is a straight line. 


. A substance x multiplies exponentially, and a given quantity of the substance 


doubles every 20 years. If we have 3 lb of substance x at the present time, how 
many lb will we have 7 years from now? 


. A substance x decays exponentially, and only half of the given quantity of x 


remains after 2 years. How long does it take for 5 Ib of x to decay to 1 lb? 


. The equation p’=ap*, a >1, is proposed as a model of the population growth 


of a certain species. Show that p(t)—>0o in finite time. Conclude, therefore, that 
this model is not accurate over a reasonable length of time. 


. A cancerous tumor satisfies the Gompertzian relation (3). Originally, when it 


contained 10* cells, the tumor was increasing at the rate of 20% per unit time. 
The numerical value of the retarding constant a is 0.02. What is the limiting 
number of cells in this tumor? 


. A tracer dose of radioactive iodine '7'I is injected into the blood stream at time 


t=Q. Assume that the original amount Q, of iodine is distributed evenly in the 
entire blood stream before any loss occurs. Let Q(t) denote the amount of 
iodine in the blood at time ¢>0. Part of the iodine leaves the blood and enters 
the urine at the rate k,Q. Another part of the iodine enters the thyroid gland at 
the rate k,Q. Find Q(‘). 


. Industrial waste is pumped into a tank containing 1000 gallons of water at the 


rate of 1 gal/min, and the well-stirred mixture leaves the tank at the same rate. 
(a) Find the concentration of waste in the tank at time ¢. (b) How long does it 
take for the concentration to reach 20%? 


. A tank contains 300 gallons of water and 100 gallons of pollutants. Fresh water 


is pumped into the tank at the rate of 2 gal/min, and the well-stirred mixture 
leaves at the same rate. How long does it take for the concentration of pollu- 
tants in the tank to decrease to 1/10 of its original value? 


10. 


11. 
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. Consider a tank containing, at time t=0, Qo lb of salt dissolved in 150 gallons 


of water. Assume that water containing + lb of salt per gallon is entering the 
tank at a rate of 3 gal/min, and that the well-stirred solution is leaving the tank 
at the same rate. Find an expression for the concentration of salt in the tank at 
time f. 


A room containing 1000 cubic feet of air is originally free of carbon monoxide. 
Beginning at time t=0, cigarette smoke containing 4 percent carbon monoxide 
is blown into the room at the rate of 0.1 ft?/min, and the well-circulated mix- 
ture leaves the room at the same rate. Find the time when the concentration of 
carbon monoxide in the room reaches 0.012 percent. (Extended exposure to 
this concentration of carbon monoxide is dangerous.) 


A 500 gallon tank originally contains 100 gallons of fresh water. Beginning at 
time t=0, water containing 50 percent pollutants flows into the tank at the rate 
of 2 gal/min, and the well-stirred mixture leaves at the rate of 1 gal/min. Find 
the concentration of pollutants in the tank at the moment it overflows. 


In Exercises 12-17, find the orthogonal trajectories of the given family of 


curves. 

12. y=cx? 13. y?-x*=c 

14. y=csinx 15. x7+y?=cx (see Exercise 13 of Sec- 
tion 1.4) 

16. y=ce* 17. y=e™ 


18. The presence of toxins in a certain medium destroys a strain of bacteria at a 


19. 


rate jointly proportional to the number of bacteria present and to the amount 
of toxin. Call the constant of proportionality a. If there were no toxins present, 
the bacteria would grow at a rate proportional to the amount present. Call this 
constant of proportionality b. Assume that the amount T of toxin is increasing 
at a constant rate c, that is, dT /dt=c, and that the production of toxins begins 
at time ¢=0. Let y(t) denote the number of living bacteria present at time t. 
(a) Find a first-order differential equation satisfied by y(t). 

(b) Solve this differential equation to obtain y(t). What happens to y(#) as ¢ ap- 

proaches oo? 


Many savings banks now advertise continuous compounding of interest. This 

means that the amount of money P(r) on deposit at time f¢, satisfies the dif- 

ferential equation dP (t)/dt=rP(t) where r is the annual interest rate and 1 is 

measured in years. Let Py denote the original principal. 

(a) Show that P(1)= Poe’. 

(b) Let r=0.0575, 0.065, 0.0675, and 0.075. Show that e” = 1.05919, 1.06716, 
1.06983, and 1.07788, respectively. Thus, the effective annual yield on inter- 
est rates of 53, 65, 63, and 71% should be 5.919, 6.716, 6.983, and 7.788%, 
respectively. Most banks, however, advertise effective annual yields of 6, 
6.81, 7.08, and 7.9%, respectively. The reason for this discrepancy is that 
banks calculate a daily rate of interest based on 360 days, and they pay in- 
terest for each day money is on deposit. For a year, one gets five extra 
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days. Thus, we must multiply the annual yields of 5.919, 6.716, 6.983, and 
7.788% by 365 /360, and then we obtain the advertised values. 

(c) It is interesting to note that the Old Colony Cooperative Bank in Rhode Is- 
land advertises an effective annual yield of 6.72% on an annual interest rate 
of 65% (the lower value), and an effective annual yield of 7.9% on an ann- 
ual interest rate of 74%. Thus they are inconsistent. 


1.9 Exact equations, and why we cannot solve 

very many differential equations 
When we began our study of differential equations, the only equation we 
could solve was dy /dt= g(t). We then enlarged our inventory to include 


all linear and separable equations. More generally, we can solve all dif- 
ferential equations which are, or can be put, in the form 


< 9(t,y)=0 (1) 


for some function ¢$(t,y). To wit, we can integrate both sides of (1) to ob- 


tain that 
o(t,y) =constant (2) 


and then solve for y as a function of ¢ from (2). 


Example 1. The equation 1 +cos(t+y)+cos(t+y)(dy /dt)=0 can be writ- 
ten in the form (d/dt)[t+sin(t+ y)]=0. Hence, 


o(t,y)=t+sin(tt+y)=c, and y=—t+arcsin(c— ¢). 


Example 2. The equation cos(t+y)+[1+cos(t+,y)]dy /dt=0 can be writ- 
ten in the form (d/dt)[ y + sin(t + y)]=0. Hence, 


o(t,y)=y+sin(t+y)=c. 


We must leave the solution in this form though, since we cannot solve for y 
explicitly as a function of time. 


Equation (1) is clearly the most general first-order differential equation 
that we can solve. Thus, it is important for us to be able to recognize when 
a differential equation can be put in this form. This is not as simple as one 
might expect. For example, it is certainly not obvious that the differential 
equation 


2+ y—sint + (3y?+ cosy +1) 2 =(0 


can be written in the form (d/dt)(y?+ t?+ 4 +siny +cost)=0. To find all 
those differential equations which can be written in the form (1), observe, 
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from the chain rule of partial differentiation, that 


3 
£ $(ty(t))= + rte a 


Hence, the differential equation M (t, . + N (t,y)(dy /dt)=0 can be written 
in the form (d/dt)$(t,y)=0 if and only if there exists a function ¢(t,y) 
such that M(t,y)=0¢/dt and N(t,y)=0¢/ dy. 

This now leads us to the following question. Given two functions 
M(t,y) and N(t,y), does there exist a function ¢(t,y) such that M(t,y) 
= 0¢/dt and N(t,y)=0¢/dy? Unfortunately, the answer to this question 
is almost always no as the following theorem shows. 


Theorem 1. Let M (t,y) and N(t,y) be continuous and have continuous par- 
tial derivatives with respect to t and y in the rectangle R consisting of those 
points (t,y) with ax t<b and c<y<d. There exists a function $(t,y) 
such that M (t,y)=0o/ dt and N(t,y)=0/ dy if, and only if, 

0M / oy = IN/dt 
in R, 
PRoor. Observe that M (t,y)=0¢/d¢ for some function ¢(t,y) if, and only 
if, 
o(Ly)= f M(Ly)dt + h(y) (3) 


where h(y) is an arbitrary function of y. Taking partial derivatives of both 
sides of (3) with respect to y, we obtain that 


a9 OM (ty) 
yy —y +h’ (y). 
Hence, d¢/ dy will be equal to N ”” if, and only if, 
— 
v(t) = [ee as ny) 
or 
nO a 
h'(y)=N(ty)- po (4) 


Now h’(y) is a function of y alone, while the vrehthand side of (4) appears 
to be a function of both ¢ and y. But a function of y alone cannot be equal 
to a function of both ¢ and y. Thus Equation (4) makes sense only if the 
right-hand side is a function of y alone, and this is the case if, and only if, 
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Hence, if dN/dt#+0M/ody, then there is no function ¢(t,y) such that M 
= 0/dt, N=0¢/dy. On the other hand, if IN/dt=0M/ dy then we can 


solve for 
h(y)= f NO) [ao tra 


Consequently, M=0¢/dt, and N=09/ dy with 


OM (t, 
o(ty)= fM(ty)dtt+ []N(tr)- f ale Oo (5) 


Definition. The differential equation 
dy 
M(ty)+N(ty)7=0 (6) 
is said to be exact if OM/dy =0N/ dt. 


The reason for this definition, of course, is that the left-hand side of (6) 
is the exact derivative of a known function of ¢ and y if OM/dy=0N/0dt. 


Remark 1. It is not essential, in the statement of Theorem 1, that 0M /dy 
= 0N/dt in a rectangle. It is sufficient if 0M /dy=0N/dt in any region R 
which contains no “holes”. That is to say, if C is any closed curve lying en- 
tirely in R, then its interior also lies entirely in R. 


Remark 2. The differential equation dy /dt=f(t,y) can always be written 
in the form M (t,y)+N(t,y)(dy /dt)=0 by setting M(t,y)=—f(t,y) and 
N(t,y)= 1. 


Remark 3. It is customary to say that the solution of an exact differential 
equation is given by ¢(¢,y)=constant. What we really mean is that the 
equation $(t,y)=c is to be solved for y as a function of ¢ and c. Unfor- 
tunately, most exact differential equations cannot be solved explicitly for y 
as a function of ¢t. While this may appear to be very disappointing, we wish 
to point out that it is quite simple, with the aid of a computer, to compute 
y(t) to any desired accuracy (see Section 1.11). 


In practice, we do not recommend memorizing Equation (5). Rather, we 
will follow one of three different methods to obtain ¢(¢,y). 
First Method: The. equation M (t,y)= 0@/ dt determines ¢$(t,y) up to an ar- 
bitrary function of y alone, that is, 


$(4,y)= J M (t,y)dt+h(y). 
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The function h(y) is then determined from the equation 


h(a) =N (ey) [a 


Second Method: If N(t,y)=0¢/ dy, then, of vecessity 


o(ty)= [IN (Ly) ay +k() 
where k(f) 1s an arbitrary function of ¢ alone. Since 


AD ON (t,y) 
M(t,y)= Or OL 


we see that k(t) is determined from the ee 


k(1)=M (ty) fo 


Note that the right-hand side of this scuation (eee Exercise 2) is a function 
of ¢ alone if OM/dy=0N/ dt. 

Third Method: The equations 0¢/0t=M(t,y) and d¢/dy= N(t,y) imply 
that 


dy + k(t) 


o(ty)= [M(ty)dith(y) and o(ty)= [N (Ey)atk(0). 


Usually, we can determine h(y) and k(t) just by inspection. 
Example 3. Find the general solution of the differential equation 


3y +e! + (31+ cosy) 2 =0. 


Solution. Here M(t,y)=3y+e' and N(t,y)=3t+cosy. This equation is 
exact since 0M /dy =3 and 0N/dt=3. Hence, there exists a function ¢(t,y) 
such that 


. do . do 
t — a 
(i) 3yt+e ry and (ii) 3¢+cosy ay 
We will find ¢(t,y) by each of the three methods outlined above. 


First Method: From (i), $(t,y)=e'+3ty+h(y). Differentiating this equa- 
tion with respect to y and using (ii) we obtain that 


h'(y)+3t=3t+ cosy. 


Thus, h(y)=siny and $(t,y)=e'+3t +siny. (Strictly speaking, h(y)= 
siny + constant. However, we already incorporate this constant of integra- 
tion into the solution when we write $(t,y)=c.) The general solution of the 
differential equation must be left in the form e'+3ty+siny=c since we 
cannot find y explicitly as a function of ¢ from this equation. 

Second Method: From (ii), $(t,y)=3t +siny + k(t). Differentiating this 
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expression with respect to ¢, and using (i) we obtain that 
3y+k'(t)=3y +e’. 


Thus, k(t)=e' and $(t,y)=3t +siny +e’. 
Third Method: From (i) and (ii) 


o(t.y)=e'+3tyt+h(y) and ¢(t,y)=3y+siny+k(t). 


Comparing these two expressions for the same function ¢(t,y) it is obvious 
that h(y)=siny and k(t)=e’. Hence 


o(t,y)=e' +3 +siny. 
Example 4. Find the solution of the initial-value problem 
d 
3? y +874 (P+87y + 12?) — = , y(2)=1. 


Solution. Here M(t,y)=3t?y +8ty? and N(t,y)=0 + 8t*y + 12y?. This 
equation is exact since 


OM _ 3,2 ON 322 
ay 3t°+16t and ry 3° + 16ty. 
Hence, there exists a function ¢$(?,y) such that 
6) 6) 
(i) 3?y+8p7?= -° and (ii) 2+8r7y+12y?= - 


Again, we will find $(t,y) by each of three methods. 


First Method: From (i), $(t,y)= Py + 4t’?y? + h(y). Differentiating this 
equation with respect to y and using (ii) we obtain that 


P+8ryth'(y)=0+807y + 12y?. 


Hence, h(y)=4y? and the general solution of the differential equation is 
o(t,y)= ty +4ty2+4y3=c. Setting t=2 and y=1 in this equation, we see 
that c=28. Thus, the solution of our initial-value problem is defined im- 
plicitly by the equation fry + 4fy?+4y3=28. 

Second Method: From (ii), $(t,y)= Py + 4t*y?+4y3+ k(t). Differentiating 
this expression with respect to ¢ and using (i) we obtain that 


3t7y +817 +k'(t)=3t-y + 8p. 
Thus k(t)=0 and $(t,y)= fy +477 y? + 4y?. 
Third Method: From (i) and (ii) 
o(ty)=Cyt+4t?y2+h(y) and 9(t,y)=ty+4t2y2+4y3+k(2). 


Comparing these two expressions for the same function ¢(t,y) we see that 
h(y)=4y? and k(t)=0. Hence, $(t,y)= ty +407 y?2+ 4y3. 

In most instances, as Examples 3 and 4 illustrate, the third method is 
the simplest to use. However, if it is much easier to integrate N with re- 
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spect to y than it is to integrate M with respect to ¢, we should use the sec- 
ond method, and vice-versa. 


Example 5. Find the solution of the initial-value problem 
Aire? + hel +214(He +2) 2 m0, y(0)=1. 
Solution. This equation is exact since 


gy ett + fe +21) = (4 +40 = < (tte? +2y), 


Hence, there exists a function ¢(¢,y) such that 


d 
(i) ate? + elt? -21— 
and 36 
.* 4,tt+y — _' 
(i) fe'*” +2y ay 


Since it is much simpler to integrate r4e’*” +2y with respect to y than it is 
to integrate 4r%e'*” + t4e't” +21 with respect to fr, we use the second 
method. From (ii), $(t,y)= tet” + y*+k(t). Differentiating this expres- 
sion with respect to ¢ and using (i) we obtain 


(14+ 42 jet? +k'(t)=40e't’ + rfe'*’ +21. 
Thus, k(t)=¢* and the general solution of the differential equation is 
o(t,y)=te't’+y2+ = c. Setting t=0 and y=1 in this equation yields c 
= 1. Thus, the solution of our initial-value problem is defined implicitly by 
the equation f4e’*’ + #7+ y?=1. 
Suppose now that we are given a differential equation 
dy 
M(ty)+N(ty)7 =0 (7) 


which is not exact. Can we make it exact? More precisely, can we find a 
function p(t,y) such that the equivalent differential equation 


m(ty)M (ty) + W(LY)N (1.9) =0 (8) 


is exact? This question is simple, in principle, to answer. The condition that 
(8) be exact is that 


ey | u(ty)M (t,y)) = 2 ( u(t») (4) 
Or 


ay ay 


ou OM _, oH aN 
ar or’ ) 


(For simplicity of writing, we have suppressed the dependence of p, M and 
N on ¢ and y in (9).) Thus, Equation (8) is exact if and only if w(t,y) satis- 
fies Equation (9). 
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Definition. A function p(t,y) satisfying Equation (9) is called an integrat- 
ing factor for the differential equation (7). 


The reason for this definition, of course, is that if » satisfies (9) then we 
can write (8) in the form (d/dt)d(t,y)=0 and this equation can be in- 
tegrated immediately to yield the solution ¢(t,y)=c. Unfortunately, 
though, there are only two special cases where we can find an explicit solu- 
tion of (9). These occur when the differential equation (7) has an integrat- 
ing factor which is either a function of ¢ alone, or a function of y alone. 
Observe that if uw is a function of ¢ alone, then Equation (9) reduces to 


(2m _ aw) 
qu (24 - aw) du dy Ot 
dt "\ dy — ar dt N M 


But this equation is meaningless unless the expression 
0M ON 


dy dt 
N 
is a function of ¢ alone, that is, 


aM _ aN 
dy Or 


rv = R(t). 


If this is the case then a(t) =exp| f R(t) at) is an integrating factor for the 
differential equation (7). 


Remark. It should be noted that the expression 


is almost always a function of both ¢ and y. Only for very special pairs of 
functions M(t,y) and N(t,y) is it a function of ¢ alone. A similar situation 
occurs if «4 is a function of y alone (see Exercise 17). It is for this reason 
that we cannot solve very many differential equations. 


Example 6, Find the general solution of the differential equation 


2 
y- t t Dy 
5 + 2ye +(y te") 0. 


Solution. Here M (t,y)=(y*/2)+2ye! and N(t,y)=y +e‘. This equation is 
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not exact since 0M /dy =y+2e' and dN /dt=e'. However, 


a(t aN) = 28 A 


N oy Or yte' 


Hence, this equation has w(t) =exp| J Ldt) =e" as an integrating factor. 


This means, of course, that the equivalent differential equation 
2 ad 
el + Dye" + (yel+ ce) — =0 


is exact. Therefore, there exists a function $(t,y) such that 


2 
\ yoo 21 _ 9 
(1) 7° +2ye ry 


and 
2t dp 


(ii) ye’+e 
From Equations (1) and (11), 


2 
JY 
o(t,y)= sel tye™ + h(y) 


and 
y? 
(ty) = sel tyes + k(t). 


Thus, h(y)=0, k(t)=0 and the general solution of the differential equation 
1s 


y? 
o(t,y) = sel tyer= C. 


Solving this equation for y as a function of ¢ we see that 
y(th=—-e'+ [e+ 2ce~*]'/, 


Example 7. Use the methods of this section to find the general solution of 
the linear equation (dy /dt)+ a(t)y = b(t). 

Solution. We write this equation in the form M(t,y)+ N(t,y)(dy / dt) =0 
with M(t,y)=a(t)y — b(t) and N(t,y)=1. This equation is not exact since 
0M /dy = a(t) and JN/dt=0. However, ((9M/dy) —(dN/dt))/N = a(t). 


Hence, p(t) =exp f a(t)dt] is an integrating factor for the first-order lin- 


ear equation. Therefore, there exists a function $(t,y) such that 


(i) m(nla(ny-oy = 


65 


1 First-order differential equations 


and 


Gi) w= 


Now, observe from (ii) that $(¢,y)=(t)y + k(t). Differentiating this equa- 
tion with respect to ¢ and using (i) we see that 


w'(t)y + k(t) = w(ta(t)y — w(t)b (4). 
But, w’(t) = a(t) w(t). Consequently, k’(¢)= — w(a)b(t) and 
HEY) = (DY f w(b(t) at. 
Hence, the general solution of the first-order linear equation is 


u(t)y — fule)o(e) dt = c, 


and this is the result we obtained in Section 1.2. 


EXERCISES 


1. Use the theorem of equality of mixed partial derivatives to show that dM/ dy 
= 0N/dt if the equation M (t,y)+ N (t,y)(dy / dt) =0 is exact. 


2. Show that the expression M (t,y)— J (AN (t,y)/0t)dy is a function of ¢t alone if 
0M /dy =0N/ dt. 


In each of Problems 3-6 find the general solution of the given differential 
equation. 


3. 2¢siny +y%et+ (Poosy +3y%e") 2 =0 
y 29) DY 
4. 1+(1+ pe (1+ Pe) = =0 


5. ysectt+secttant+(2y + tant) > =0 


ye 


t oth ® _ 
6. 7 2ye'+(y OT 0 


In each of Problems 7-11, solve the given initial-value problem. 


7. 2ty*+31y*— =0, y()=1 
8. 2tcosy +317y + (4° — 2? siny -y)Z =o, y(0)=2 
2 nD _ _ 
9, 31°+44+(2y+2t aR =O: y(0)=1 
10. y(cos2t)e” —2sin2t)e” +2t+(t(cos2te” -3) 2 =0, y(@O)=0 
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d 
11. 3 +y24+(024 y) = =0, y(2)=1 
In each of Problems 12-14, determine the constant a so that the equation 
is exact, and then solve the resulting equation. 


dy 


2ty 2y = 
12. t+ ye*” + ate 77 0 


at+1 
pg 14,1, 4 _, 


2? yp dt 


14. 6% *¥ 4-327 y72+(2yt> + eat) @ =0 
15. Show that every separable equation of the form M (t)+ N(y)dy /dt=0 is exact. 
16. Find all functions f(t) such that the differential equation 
y*sint + yf (t)(dy/dt)=0 
is exact. Solve the differential equation for these f(t). 
17. Show that if ((0N/dt)—(0M/dy))/M=Q(y), then the differential equation 
M (t,y)+ N(t,y) dy /dt=0 has an integrating factor p(y)= exp( [Q (») 4). 
18. The differential equation f(t)(dy /dt)+ t?+y =0 is known to have an integrat- 
ing factor u(t)=¢. Find all possible functions f(t). 


19. The differential equation e’secy — tany + (dy /dt)=0 has an integrating factor 
of the form e~“ cosy for some constant a. Find a, and then solve the differen- 
tial equation. 


20. The Bernoulli differential equation is (dy/dt)+ a(t)y = b(t)y”. Multiplying 
through by m(e) = exp( f a(s) at), we can rewrite this equation in the form 


d/dt( u(t)y)= b(t) w(t)y”. Find the general solution of this equation by find- 
ing an appropriate integrating factor. Hint: Divide both sides of the equation 
by an appropriate function of y. 


1.10 The existence—uniqueness theorem; Picard iteration 


Consider the initial-value problem 


ad 
SLAY V(t) =Yo (1) 


where f is a given function of ¢ and y. Chances are, as the remarks in Sec- 
tion 1.9 indicate, that we will be unable to solve (1) explicitly. This leads us 
to ask the following questions. 


1. How are we to know that the initial-value problem (1) actually has a 
solution if we can’t exhibit it? 
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2. How do we know that there is only one solution y(t) of (1)? Perhaps 
there are two, three, or even infinitely many solutions. 

3. Why bother asking the first two questions? After all, what’s the use of 
determining whether (1) has a unique solution if we won’t be able to ex- 
plicitly exhibit it? 


The answer to the third question lies in the observation that it is never 
necessary, in applications, to find the solution y(t) of (1) to more than a 
finite number of decimal places. Usually, it is more than sufficient to find 
y(t) to four decimal places. As we shall see in Sections 1.13—17, this can be 
done quite easily with the aid of a digital computer. In fact, we will be able 
to compute y(t) to eight, and even sixteen, decimal places. Thus, the 
knowledge that (1) has a unique solution y(f) is our hunting license to go 
looking for it. 

To resolve the first question, we must establish the existence of a func- 
tion y(t) whose value at ¢=%, is yo, and whose derivative at any time / 
equals f(t,y(t)). In order to accomplish this, we must find a theorem which 
enables us to establish the existence of a function having certain proper- 
ties, without our having to exhibit this function explicitly. If we search 
through the Calculus, we find that we encounter such a situation exactly 
once, and this is in connection with the theory of limits. As we show in Ap- 
pendix B, it is often possible to prove that a sequence of functions y, (¢) has 
a limit y(t), without our having to exhibit y(t). For example, we can prove 
that the sequence of functions 


sin rt + sin 27t + 4 sinnat 


1? 2? n? 
has a limit y(t) even though we cannot exhibit y(t) explicitly. This suggests 
the following algorithm for proving the existence of a solution y(¢) of (1). 


Yn(t)= 


(a) Construct a sequence of functions y,(¢) which come closer and closer 
to solving (1). 

(b) Show that the sequence of functions y,(¢) has a limit y(t) on a suitable 
interval fp5< ¢< iota. 

(c) Prove that y(¢) is a solution of (1) on this interval. 


We now show how to implement this algorithm. 


(a) Construction of the approximating sequence y,(t) 


The problem of finding a sequence of functions that come closer and 
closer to satisfying a certain equation is one that arises quite often in 
mathematics. Experience has shown that it is often easiest to resolve this 
problem when our equation can be written in the special form 


y(th=L(ty(2)), (2) 
where L may depend explicitly on y, and on integrals of functions of y. 
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For example, we may wish to find a function y(Z) satisfying 


y(t)=1+sin[ ¢+y(2) ], 


or 
t 

y(alty2(a+ [ y(s)as. 
0 

In these two cases, L(t,y(t)) is an abbreviation for 


1+sin| ¢+y(t)| 


and 


1+y2()+ f y%(s)abs 


respectively. 

The key to understanding what is special about Equation (2) 1s to view 
L(t,y(t)) as a “machine” that takes in one function and gives back another 
one. For example, let 


L(ty())=1497()+ f'y?(s)as 


If we plug the function y(t)=7¢ into this machine, (that 1s, if we compute 


1+ 122+ [ ‘s3ds) then the machine returns to us the function 1+ 2+ 14/4. If 


0 , . ; . ; 
we plug the function y(t)=cos? into this machine, then it returns to us the 
function 


3 
L+cos’t+ [cos’sds=1+cos*r + sint— sin 
0 3 
According to this viewpoint, we can characterize all solutions y(t) of (2) as 
those functions y(t) which the machine L leaves unchanged. In other 
words, if we plug a function y(t) into the machine L, and the machine re- 
turns to us this same function, then y(¢) is a solution of (2). 
We can put the initial-value problem (1) into the special form (2) by in- 
tegrating both sides of the differential equation y’ = f(t,y) with respect to ¢. 
Specifically, if y(t) satisfies (1), then 


[ Pe [rose 


lo 


so that 
y()=yo+ f S(s.v(s)) 4s (3) 


Conversely, if y(t) is continuous and satisfies (3), then dy /dt=f(t,y(2)). 
Moreover, y(fo) is obviously yo. Therefore, y(t) is a solution of (1) if, and 
only if, it is a continuous solution of (3). 
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Equation (3) is called an integral equation, and it is in the special form 
(2) if we set 


L(t.»())= vot [ F(s.7(s))a 


This suggests the following scheme for constructing a sequence of “ap- 
proximate solutions” y, (¢) of (3). Let us start by guessing a solution y,(‘) of 
(3). The simplest possible guess is yo(t)=yo. To check whether y,(¢) is a 
solution of (3), we compute 


ni(D=ror | "f(s,y9(s)) ds. 


If y,(Q=yo, then y(t)=y, is indeed a solution of (3). If not, then we try 
y(t) as our next guess. To check whether y,(f) is a solution of (3), we com- 
pute 


y,(t)=yo+ f “f (sy (s)) ds, 


and so on. In this manner, we define a sequence of functions y,(t), 
y,(t),..., where 


Ynail)=¥or { S(s.rn(s))a (4) 


These functions y,(t) are called successive approximations, or Picard 
iterates, after the French mathematician Picard who first discovered them. 
Remarkably, these Picard iterates always converge, on a suitable interval, 
to a solution y(t) of (3). 


Example 1. Compute the Picard iterates for the initial-value problem 


y=y,  y(0)=1, 
and show that they converge to the solution y(t)=e'. 
Solution. The integral equation corresponding to this initial-value problem 
is 
t 
y(=1+ f y(s)ds. 
0 
Hence, y(t) = 1 
t 
yi (Q=1+ f lds=1+t 
0 


{2 


1+ ['y,(s)\ds=14 [(‘(14+5)ds=1 
y,(t)= +f vi(s) s= +f (+s) salttt 
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and, in general, 


t / gs?! 
ya(t)=14 yn1(8)d5= 1+ ltst..+ oy [as 
_ t? t” 
=Itita + to 


Since e'=1+1f+17/2!+..., we see that the Picard iterates y,(t) converge 
to the solution y(t) of this initial-value problem. 


Example 2. Compute the Picard iterates y,(t),y,(t) for the initial-value 
problem y’=1+y?, y(I)=1. 

Solution. The integral equation corresponding to this initial-value problem 
1S 


=14+ fli+y 
y(t) + f [1+y°(s) | ds. 
Hence, y)(¢) = 1 


yi(t=lt [14 1ds=1+2(0-1) 
l 


and 


yagi f {1+ [1426-)P } a 


= 1+2(t—1)+3(t—1)? +4(¢-1)° +2(¢- 1). 


Notice that it is already quite cumbersome to compute y,(Z). 


(b) Convergence of the Picard iterates 


As was mentioned in Section 1.4, the solutions of nonlinear differential 
equations may not exist for all time ¢. Therefore, we cannot expect the 
Picard iterates y,(¢) of (3) to converge for all ¢. To provide us with a clue, 
or estimate, of where the Picard iterates converge, we try to find an inter- 
val in which all the y,(f) are uniformly bounded (that is, |y,(¢)|< K for 
some fixed constant K). Equivalently, we seek a rectangle R which con- 
tains the graphs of all the Picard iterates y(t). Lemma | shows us how to 
find such a rectangle. 


Lemma 1. Choose any two positive numbers a and b, and let R be the rectan- 
gle: th< t< tot a,|y—yo| < b. Compute 
; b 
M= aX \f(ty)|, and set a = min(a, uM ) 
Then, 
IYn(t) —Yol < M(t ~ to) (3) 
for th t<tota. 
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Lemma | states that the graph of y,(¢) is sandwiched between the lines 
Y=yot M(t—t)) and y=yy>—-M(t—t,), for t) << t<t)+a. These lines 
leave the rectangle R at t=1)+a if a<b/M, and at t=14,+b/M if b/M 
<a (see Figures la and 1b). In either case, therefore, the graph of y, (7) 1s 
contained in R for tg <t<tgta. 


y+b 
x) 
‘e 
Xo 
y—b 
t t 
to t ta t tt a 
(a) (b) 


Figure 1. (a) a=a; (b)a=b/M 


PROOF OF LEMMA |. We establish (5) by induction on n. Observe first that 
(5) is obviously true for n=0, since y)(t)=)>. Next, we must show that (5) 
is true for n=/+1 if it is true for n=/. But this follows immediately, for if 
|¥(D—Yol < M (¢— to), then 


yoal) role f1(50960)) 


t 
< [ |F(s.9,(s))|ds < M (t— 10) 
to 
for fg < ¢ < tg +a. Consequently, (5) is true for all n, by induction. a 


We now show that the Picard iterates y,(t) of (3) converge for each ¢ in 
the interval tp <¢< t)+ a, if df/dy exists and is continuous. Our first step is 
to reduce the problem of showing that the sequence of functions y, (1) con- 
verges to the much simpler problem of proving that an infinite series con- 
verges. This is accomplished by writing y,(t) in the form 

Yn(t) =Yo(t) + [ ¥1(t)—yo(t) | +... + [ ¥n(t) =Yn—1 (2) ]- 


Clearly, the sequence y, (tf) converges if, and only if, the infinite series 


[y1(t)-yo(2)] + [ y2(t)—-1 (1) | +...4+[y()-ry-i)] +... (6) 
converges. To prove that the infinite series (6) converges, it suffices to 
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show that 


D |Ynlt)-Yn-1(1)| <0. (7) 
n=] 
This is accomplished in the following manner. Observe that 
t 
yal) Pall] f TF(6%- 1) ~F64-a9)] 
0 


<f V(SIn1(3)) —F(S.n2(3))] 4s 


-[ 


0 


of (s,&(s)) 


yn 165)-Yn-2ls)| as 


where &(s) lies between y,_,(s) and y,_.,(s). (Recall that f(x,)—f(x,)= 
f'(§(*,— x2), where € is some number between x, and x,.) It follows im- 
mediately from Lemma | that the points (s,&(s)) all lie in the rectangle R 
for s<t)+a. Consequently, 


t 
a(n COIS LY Yn-i(8)-Yn-2()14 oS t< Hota, (8) 
0 


where 
of(ty 
L= y) ; (9) 
(4y)inR| oy 
Equation (9) defines the constant L. Setting n=2 in (8) gives 
t t 
Iv2(t)- vi (I< L{ yi(s)—yolds< Lf M(s—to)ds 
to to 
LM (t— to) 
=—,—. 
This, in turn, implies that 
t 2 
t (5— to) 
a(D—ra(OI< Lf |y2(s)—y1(s)|ds< ML? il > as 
to to 
ML?(t—t)) 
~ 3b 
Proceeding inductively, we see that 
ML"~! (t- to)” 
| y(t) -—y,—1(2)| < ——————_,,_ for fg < t< tyta. (10) 


n! 


73 


1 First-order differential equations 


Therefore, for (7< ¢< tg +a, 

yi) yo +ly2(O -—i I+ --. 

ML(t~t) | ML?(t~ to) 
+ —$—_$_$ 


<M (t—t))+ T rT 
< Ma+ Moe + Mio +... 
MI 4 (aby , (aby 
L 2! 3! 
-4 e** —]), 


This quantity, obviously, is less than infinity. Consequently, the Picard 
iterates y,(¢) converge for each ¢ in the interval t)<¢<t)+a. (A similar 
argument shows that y,(t) converges for each ¢ in the interval p-B<ir< 
to, Where B=min(a,b/N), and N is the maximum value of | f(t,y)| for 
(t,y) in the rectangle tj —-a<t< to,|y —yo| < b.) We will denote the limit of 
the sequence y, (t) by y(Z). O 


(c) Proof that y(t) satisfies the initial-value problem (1) 
We will show that y(¢) satisfies the integral equation 
t 
y(t)=yor [ F(s.r(s))as (11) 
Lo 


and that y(¢) is continuous. To this end, recall that the Picard iterates y, (t) 
are defined recursively through the equation 


t 
Yn ()=Vot { F(sYn(s)) a (12) 
0 
Taking limits of both sides of (12) gives 
. t 
y(t)=yo+ jim [ f(s,yq(s)) as. (13) 
fo 
To show that the right-hand side of (13) equals 


yor f “f(s,y(s)) ds, 


(that is, to justify passing the limit through the integral sign) we must show 
that 


[ fex@))as- J S(s.ra(s)) 4s 


approaches zero as n approaches infinity. This is accomplished in the 
following manner. Observe first that the graph of y(t) lies in the rectangle 
R for t< tg +a, since it is the limit of functions y,(¢) whose graphs lie in R. 


74 


1.10 The existence—uniqueness theorem; Picard iteration 


Hence 


[Her a~ f'oon()es 


< fLPs.() Fra M45 < Lf 1y(s) ral) 


where L is defined by Equation (9). Next, observe that 


y(s)—y,(8)= Sf y(s)—»-18)] 


j=zntl 
since 


yiarot D [y)-y-00) 
and 


n(s)=V0+ B [H) 10} 


Consequently, from (10), 


y(s)-», (<M >) LI 
jJ=ntl 
ore 00 Jj 
a M (aL) 
<m > w= S 7 (14) 
j=ntil j=ntl 


and 


<M Ss ey —— fds 


j=ntl 0 


[Horna Fela 


oo J 
(aL) 
< Ma > i 
j=ntl 


This summation approaches zero as n approaches infinity, since it is the 
tail end of the convergent Taylor series expansion of e*”. Hence, 


Jim f F(s.y,(s))ds= [F(s.9(s))as 


and y(t) satisfies (11). 
To show that y(t) is continuous, we must show that for every e>0 we 
can find 6 >0 such that 


ly(tth)—y(t)|<e if |hA|<6. 
Now, we cannot compare y(t+h) with y(s) directly, since we do not know 
y(t) explicitly. To overcome this difficulty, we choose a large integer N and 
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observe that 
y(tth)—y(t)= | y(t+h)—yy(t+A)] 
+[ yy (t+h)—yy(t)]+[ ov ()—-y (9) ]- 
Specifically, we choose N so large that 
M wa (aly ¢ 
L >> 7 S3° 
jJ=HN+1 
Then, from (14), 
IV(tth)—yy(tth)|<z and |yy(N—-y(Q<F. 


for t<4+ a, and h sufficiently small (so that ¢+h<t)+a.) Next, observe 
that y,(z) is continuous, since it is obtained from N repeated integrations 
of continuous functions. Therefore, we can choose 6 >0 so small that 


lyv (t+ h)—yy(t)| <5 for |A|<6. 


Consequently, 
ly(t+h)—y(t)] < [y+ A) —yy (t+ A) + [yy (t+ 4) — yy (OI 


€,€, € _ 
+| yy (t) y(Ni<3 + 3 + 3 —=eE 


for |h| <6. Therefore, y(f) is a continuous solution of the integral equation 
(11), and this completes our proof that y(t) satisfies (1). a 


In summary, we have proven the following theorem. 


Theorem 2. Let f and of/dy be continuous in the rectangle R:ty<t< tyta, 
|¥ —Yol < b. Compute 


; b 
M= ty) = on B 
ax AIF y)|, andset a min(a | 


Then, the initial-value problem y’=f(t,y), y(to) =) has at least one solu- 
tion y(t) on the interval tj<t<tgt+a. A similar result is true for t < to. 


Remark. The number a in Theorem 2 depends specifically on our choice 
of a and b. Different choices of a and b lead to different values of a. 
Moreover, a doesn’t necessarily increase when a and b increase, since an 
increase in a or b will generally result in an increase in M. 


Finally, we turn our attention to the problem of uniqueness of solutions 
of (1). Consider the initial-value problem 


d 
— =(sin21)y'7, (0) =0. (15) 
One solution of (15) is y(t)=0. Additional solutions can be obtained if we 
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ignore the fact that y(0)=0 and rewrite the differential equation in the 
form 


1 d_. 

yi i =sin2t, 
or 

d3y 

a) =sin2t. 
Then, 

2/3 _ 
x _! = 2t _ sin? £ 


and y= + V 8/27 sin°¢ are two additional solutions of (15). 

Now, initial-value problems that have more than one solution are 
clearly unacceptable in applications. Therefore, it is important for us to 
find out exactly what is “wrong” with the initial-value problem (15) that it 
has more than one solution. If we look carefully at the right-hand side of 
this differential equation, we see that it does not have a partial derivative 
with respect to y at y=0. This is indeed the problem, as the following theo- 
rem shows. 


Theorem 2’. Let f and df/ dy be continuous in the rectangle R:ty<t< tot, 
|¥ —Yol < 6. Compute 


. b 
M= ty )I, dset a= neal 
max IC y)|, andset a min( a 7} 


Then, the initial-value problem 


V=f(ty),  V(to) =Yo (16) 
has a unique solution y(t) on the interval ty< t< tg+ a. In other words, if 
y(t) and z(t) are two solutions of (16), then y(t) must equal z(t) for tg<t 
<lota. 


PRooF. Theorem 2 guarantees the existence of at least one solution y(t) of 
(16). Suppose that z(¢) is a second solution of (16). Then, 


v(N=yot [ f(s.v(s))as and (= vot f S(s2(s))as 


Subtracting these two equations gives 


-2001=|f'[1oo(9)~F62(9)] 4 


<{ 1f(s9(9))-F(s2())las 
< Lf '\y(s)-2(s)las 
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where L is the maximum value of |df/dy| for (t,y) in R. As Lemma 2 be- 
low shows, this inequality implies that y(t)= z(t). Hence, the initial-value 
problem (16) has a unique solution y(7¢). 0 


Lemma 2. Let w(t) be a nonnegative function, with 


w(t)< Lf w(s) ds (17) 
Lo 
Then, w(t) is identically zero. 


FAKE PROOF. Differentiating both sides of (17) gives 


dw dw 
i <Lw(t), or ” Lw(t) <0. 


Multiplying both sides of this inequality by the integrating factor e~*“~ 
gives 
LO <0, sothat e~!“—-w(r) < w(t) 


for ¢ > tp. But w(t)) must be zero if w(¢) is nonnegative and satisfies (17). 
Consequently, e~£“~ w(t) <0, and this implies that w(t) is identically 
Zero. 


The error in this proof, of course, is that we cannot differentiate both 
sides of an inequality, and still expect to preserve the inequality. For exam- 
ple, the function f,(¢)=2t-—2 is less than f,(t)=7¢ on the interval [0, 1], but 
Fi(t) is greater than f,(¢) on this interval. We make this proof “kosher” by 
the clever trick of setting 


U(1)= 'w(s) ds. 
Then, 


a =w(t)< Lf w(s)ds= LU (i). 


Consequently, e~*"~U(t) < U(t))=0, for t> fp, and thus U(t)=0. This, 
in turn, implies that w(t)=0 since 


0< w(t) < Lf w(s)ds= LU(1)=0. g 


Example 3. Show that the solution y(t) of the initial-value problem 


ay 2 2 
a y — 
a =f“+e”’, y(0)=0 


exists for 0<¢< $, and in this interval, | y(2)| < 1. 
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Solution. Let R be the rectangle O<t< > |y| <1. Computing 


2 
M= max P+e”=14+(5) =3, 
(t,y)in R 


we see that y(¢) exists for 


e l ] l 
< < —_  ——___ — 
OK<t min( 544 5? 

and in this interval, | y()| <1. 


Example 4. Show that the solution y() of the initial-value problem 
dy —? 3 
a e' +y?, y(0)=1 


exists for 0< t< 1/9, and in this interval, 0< y <2. 
Solution. Let R be the rectangle 0< t< 3, O< y <2. Computing 


M= max e-°+y3=1+23=9, 
(t,y)in R 
we see that y(t) exists for 
O0<r¢<min(5,5) 


and in this interval, 0< y <2. 


Example 5. What is the largest interval of existence that Theorem 2 pre- 
dicts for the solution y(#) of the initial-value problem y’=1+y, y(0)=0? 
Solution. Let R be the rectangle 0< ¢< a, |y| <b. Computing 


M= max 1+y?=1+5?, 
(t,y)in R 


we see that y(¢) exists for 
; b 
O<t< a=min{ a,—*— 
1+}? 


Clearly, the largest a that we can achieve is the maximum value of the 
function b/(1+5?). This maximum value is +. Hence, Theorem 2 predicts 
that y(t) exists for 0<t< 4. The fact that y(t)=tant exists forO<t<a7/2 
points out the limitation of Theorem 2. 


Example 6. Suppose that | f(t,y)|< K in the strip t)< t<00, -o<y<o. 
Show that the solution y(t) of the initial-value problem y’=/(t,y), y(to) = 
Yo exists for all ¢> to. 

Solution. Let R be the rectangle 49 < t< ty) +a, | y—yo| <b. The quantity 


M= max t, 
am ix |F(4y)) 


79 


1 First-order differential equations 


is at most K. Hence, y(¢) exists for 
to<t<to+min(a,b/K). 


Now, we can make the quantity min(a,b/K) as large as desired by choos- 
ing a and 5b sufficiently large. Theretore y(¢) exists for t > fo. 


EXERCISES 


1. Construct the Picard iterates for the initial-value problem y’=2t(y + 1), y(0)= 
0 and show that they converge to the solution y(t)= e?—], 


2. Compute the first two Picard iterates for the initial-value problem y’= 7? +7, 
y(0)=1. 


3. Compute the first three Picard iterates for the initial-value problem y’ =e’ + y?, 
y(0)=0. 


In each of Problems 4-15, show that the solution y(t) of the given initial- 
value problem exists on the specified interval. 


4. y'=y?+cos??, y(0)=0; O<r<} 

5. y=l+yt+y’cost, y(0)=0; O0<r<} 

6. =t+y, v0) =0;0 <1 < 4) 

7. y=e""+y2, y(0)=0; O<r<i 

8. y’=e- + y2, Wl) =0; 1 <t< 1+ Ve /2 
V2 

1+(1+V2) 

10. y’=yt+te *+e ', yO=0; O<t<l 


9, =e +y?, yO=H1; O<r< 


ll. y=y3te*, y(0)=04; O<t<3 


10 


12. y’=e"-", yO)=1; O<r< vs al eo ~(i+ V3)/2yP 


13. y’'=(4y+e-“)e”, y(0)=0; O<r< ; 


] 
e 


14. y’'=e"'+In(l+y?), yO)=0; 0<t<0 
15. y’ = 4(1+cos4/)y — 3,(1—cos4t)y*, y(0)=100; O<r<l 
16. Consider the initial-value problem 
y=tty?, — y(0)=0, (*) 


and let R be the rectangle O< t<a, —b< y<b. 
(a) Show that the solution y(t) of (*) exists for 


O< r<min(a, =). 
a* +b? 
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(b) Show that the maximum value of b/(a?+ b?), for a fixed, is 1/2a. 
(c) Show that a=min(a, 1a) is largest when a=1/V2. 
(d) Conclude that the solution y(?) of (*) exists forO<t<1/V2. 
17. Prove that y(t)= — 1 is the only solution of the initial-value problem 
y=t+y), yO=—1. 
18. Find a nontrivial solution of the initial-value problem y’=ty%, y(0)=0, a> 1. 
Does this violate Theorem 2’? Explain. 


19. Find a solution of the initial-value problem y’=¢\/ 1—y? , y(0)=1, other than 
y(t)=1. Does this violate Theorem 2’? Explain. 


20. Here is an alternate proof of Lemma 2. Let w(t) be a nonnegative function 
with 
t 
w(t)< Lf w(s)ds (*) 
fo 


on the interval f9 < t< t9+ a. Since w(t) is continuous, we can find a constant 

A such that 0< w(t) <A for tg<t<itgta. 

(a) Show that w(t) < LA(t— tp). 

(b) Use this estimate of w(t) in (*) to obtain 

AL*(t — to) 

a, 

(c) Proceeding inductively, show that w(t) < AL"(t—tp)"/n!, for every integer 
n. 

(d) Conclude that w(t)=0 for tg <t<tgta. 


w(t)< 


1.11 Finding roots of equations by iteration 


Suppose that we are interested in finding the roots of an equation having 
the special form 


x= f(x). (1) 
For example, we might want to find the roots of the equation 
x=sinx +4. 
The methods introduced in the previous section suggest the following algo- 


rithm for solving this problem. 


1. Try an initial guess x9, and use this number to construct a sequence of 
guesses X),X2,X3,..., where x,;=f(Xo), x, =f(x,), X;= f(x), and so on. 

2. Show that this sequence of iterates x, has a limit 7 as n approaches in- 
finity. 

3. Show that 7 is a root of (1); ie., 7=f(n). 


The following theorem tells us when this algorithm will work. 


Theorem 3. Let f(x) and f'(x) be continuous in the interval a< x < b, with 
|.f’(x)| <A <1 in this interval. Suppose, moreover, that the iterates x,, de- 
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fined recursively by the equation 

Xna1 =F (%,) (2) 
all lie in the interval [a,b]. Then, the iterates x,, converge to a unique num- 
ber » satisfying (1). 


PRooF. We convert the problem of proving that the sequence x, converges 
to the simpler problem of proving that an infinite series converges by writ- 
ing x, in the form 


Xp = Xqt(X— Xo) + (%_— X) + «0 HC, X11): 
Clearly, the sequence x, converges if, and only if, the infinite series 
oe) 
(X1— Xo) +(%.—X,) +... +, Hy) Fe = > (Xn Xn—1) 
n=} 


converges. To prove that this infinite series converges, it suffices to show 
that 


ioe) 
Jx1— Xo + |xg— y+... = D [X— Xp-1] <0. 
n=} 


This is accomplished in the following manner. By definition, x, =f(x,_ 1) 
and x,_,=f/(x,_,). Subtracting these two equations gives 


Xn %n-1 =f (Xn- 1) —f (X_-2) =f'(E)(Xn-1 ~ Xn—2)s 


where € is some number between x, _, and x,_>. In particular, € is in the 
interval [a,b]. Therefore, | f’(§| <A, and 


Xn ~ Xp al SAL %q_— 1 Xp—al- (3) 
Iterating this inequality n—1 times gives 
|x, ~ Xn- 1 < A| x, — 1 kn 1 


< M*|x, 27 Xn—3| 


<A" "1x,— Xp]. 


Consequently, 


00 ore) 
> |X,—X,—11< > A" "x1 — x9] 
n=] 


n=] 
=|x,— Xo|[[1+A+A24+...] = 


This quantity, obviously, is less than infinity. Therefore, the sequence of 
iterates x, has a limit y as n approaches infinity. Taking limits of both 
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sides of (2) gives 
n= lim x,41= Jim, f(x,)=S(n). 


Hence, 7 1S a root of (1). 
Finally, suppose that 7 is not unique; that is, there exist two solutions 7, 
and 7, of (1) in the interval [a,b]. Then, 


™— 2.=f(m) —f (M2) =f'(E)(m — 1); 


where € is some number between 7, and 7,. This implies that 1, = n, or f’(§) 
= 1. But f’(€) cannot be one, since € is in the interval [a,b]. Therefore, 7, = 


N2- C 
Example 1. Show that the sequence of iterates 
Xp X,=1+4arctanxy, x ,=1+ Jarctanx,,... 
converge to a unique number 7 satisfying 
n=1+ Jarctann 


for every initial guess Xp. 

Solution. Let f(x)=1+4arctanx. Computing f’(x)=41/(1+ x”), we see 
that |f’(x)| is always less than or equal to 3. Hence, by Theorem 3, the 
sequence of iterates x9,x,,X>,... converges to the unique root 7 of the 


equation x = 1+ arctanx, for every choice of Xp. 


There are many instances where we know, a priori, that the equation 
x= f(x) has a unique solution y in a given interval [a,b]. In these in- 
stances, we can use Theorem 3 to obtain a very good approximation of 7. 
Indeed, life is especially simple in these instances, since we don’t have to 
check that the iterates x, all lie in a specified interval. If x9 is sufficiently 
close to n, then the iterates x, will always converge to n, as we now show. 


Theorem 4. Assume that f(n)=%, and that | f’(x)|<A<1 in the interval 
|x — |< a. Choose a number xq in this interval. Then, the sequence of 
iterates x,, defined recursively by the equation x,,,,=f(x,,), will always 
converge to 7. 


Proor. Denote the interval |x— |< a by J. By Theorem 3, it suffices to 
show that all the iterates x, lie in J. To this end, observe that 


X41 N=S(%) SF (m) =f'(O)(0G5- 0) 


where is some number between x, and 7. In particular, is in J if x; is in 
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I. Thus, 
X41 | SAX; — nl <|x;— 9] (4) 


if x, is in J. This implies that x,,, is in J whenever x, is in J. By induction, 
therefore, all the iterates x, lie in J. CO 


Equation (4) also shows that x,,,, is closer to n than x,. Specifically, the 
error we make in approximating 7 by x, decreases by at least a factor of A 
each time we increase n. Thus, if A is very small, then the convergence of x,, 
to 7 is very rapid, while if A is close to one, then the convergence is very 
slow. 


Example 2. 
(a) Show that the equation 


x=sinx+4 (5) 


has a unique root 7 in the interval [7 /4,7/2]. 
(b) Show that the sequence of numbers 


— 1 _— 1 
Xo X,=SMXp+ GZ, X2=SINX, +]... 


will converge to 7 if 7/4<x)< 7/2. 

(c) Write a computer program to evaluate the first N iterates x,,x>,...,Xy- 
Solution. 

(a) Let g(x)=x-—sinx— i, and observe that g(7/4) is negative while 
g(7/2) is positive. Moreover, g(x) is a monotonic increasing function of x 
for 7/4< x<7/2, since its derivative is strictly positive in this interval. 
Therefore, Equation (5) has a unique root x= 7 in the interval 7/4<x< 
a /2. 

(b) Let J denote the interval n—7/4<x<1n+7/4. The left endpoint of 
this interval is greater than zero, while the right endpoint is less than 37/4. 
Hence, there exists a number A, with O<A< 1, such that 


<i 


|cos x| = | £ (sinx + 1) 


for x in J. Clearly, the interval [7/4,7/2] is contained in J. Therefore, by 
Theorem 4, the sequence of numbers 


Xo Xy=SiINXpt+ 5, X,=sinx, + 4,... 
will converge to y for every xq in the interval [7 /4,7/2]. 
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(C) 


Pascal Program 


Program Iterate (input, output); 
var 

X: array[0..199] of real; 

k, N: integer; 


begin 
readin(X[0], N); 
page, 
writeln(‘N’:4, ‘X[N]':14); 
for k :=0 to N do 
begin 
writeln(K:4, ‘ °:4, X[k]:17:9); 
X[k + 1] :=0.25 + sin (X[k]); 
end; 
end. 


Fortran Program 


DIMENSION X(200) 
READ (5,10) X0,N 
10 FORMAT (F15.8, |5) 
C COMPUTE X(1) FIRST 
X(1) = 0.25 + SIN(XO) 
KA=0 
KB=1 
WRITE (6, 20) KA, XO, KB, X(1) 
20 FORMAT (1H1,4X, ‘N’, 10X, ‘X’/(1H, 3X, 13, 4X, F15.9)) 
C COMPUTE X(2) THRU X(N) 
DO 40 K=2,N 
X(K) = 0.25 + SIN(X(K — 1)) 
WRITE (6, 30) K, X(K) 
30 FORMAT (1H, 3X, 3, 4X, F15.9) 
40 CONTINUE 
CALL EXIT 
END 


See also C Program 1 in Appendix C for a sample C program. 
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Table 1 


n 

0 117110411 
l 1.09147099 1.17122962 
2 1.13730626 1.17122964 
3 1.13750531 1.17122965 
4 1.16580403 1.17122965 
5 1.16910543 1.17122965 
6 1.17040121 1.17122965 
7 1.17090706 1.17122965 


In many instances, we want to compute a root of the equation x = f(x) 
to within a certain accuracy. The easiest, and most efficient way of accom- 
plishing this is to instruct the computer to terminate the program at k =; if 
X41 agrees with x, within the prescribed accuracy. 


EXERCISES 


1. Let y be the unique root of Equation (5). 
(a) Let x9=7/4. Show that 20 iterations are required to find 7 to 8 significant 
decimal places. 
(b) Let x9=27/2. Show that 20 iterations are required to find y to 8 decimal 
places. 
(c). Let x9 =37/8. Show that 16 iterations are required to find n to 8 decimal 
places. 


2. (a) Determine suitable values of x9 so that the iterates x,, defined by the equa- 
tion 
Xn+1 = Xn +(%n?—2) 
will converge to V2. 


(b) Choose x9 = 1.4. Show that 14 iterations are required to find V2 to 8 signifi- 
cant decimal places. (V2 = 1.41421356 to 8 significant decimal places.) 


3. (a) Determine suitable values of xp so that the iterates x,, defined by the equa- 
tion 
Xn+1 = Xn — o(Xn? —2) 
will converge to V2. 


(b) Choose x= 1.4. Show that 30 iterations are required to find V2 to 6 signifi- 
cant decimal places. 


4, (a) Determine a suitable value of a so that the iterates x,, defined by the equa- 
tion 
Xn41=X_—a(xX,?—-3), xg = 1.7 
will converge to V3 . 
(b) Find V3 to 6 significant decimal places. 
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5. Let n be the unique root of the equation x =1+4 arctanx. Find 7 to 5 signifi- 
cant decimal places. 


6. (a) Show that the equation 2— x =(Inx)/4 has a unique root x = 7 in the inter- 
val 0< x < 00. 
(b) Let 
Xn4) =2—(Inx,)/4, n=0,1,2,... 
Show that 1< x, <2 if 1<xg<2. 


(c) Prove that x,-7 as n-00 if 1< x <2. 
(d) Compute 7 to 5 significant decimal places. 


7. (a) Show that the equation x =cos.x has a unique root x = 7 in the interval 0< x 
<1. 
(b) Let x,4,;=cosx,, n=0,1,2,..., with 0< x9< 1. Show that 0< x, <1. Con- 
clude, therefore, that x,—17 as n-00. 
(c) Find y to 5 significant decimal places. 


1.11.1 Newton’s method 


The method of iteration which we used to solve the equation x = f(x) can 
also be used to solve the equation g(x)=0. To wit, any solution x=7 of 
the equation g(x)=0 is also a solution of the equation 

x= f(x)=x— g(x), (1) 
and vice-versa. Better yet, any solution x=7 of the equation g(x)=0 is 
also a solution of the equation 
g(x) 
h(x) 


for any function h(x). Of course, h(x) must be unequal to zero for x near 


x=f(x)=x- (2) 


Equation (2) has an arbitrary function h(x) in it. Let us try and choose 
h(x) so that (1) the assumptions of Theorem 4, Section 1.11 are satisfied, 
and (11) the iterates 


x 2 (Xp) w=x- g(x,) 
A(x) 7 h() 


converge as “rapidly as possible” to the desired root 7. To this end, we 
compute 


XQ, X= 


Co df BOD] of). a(x) 
rey fs Tole h(x) A(x) 


and observe that 


g’(n) 


I(m=1— To 
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This suggests that we set h(x)=g'(x), since then f’(n)=0. Consequently, 
the iterates x,, defined recursively by the equation 

x ax, - 2 n=0,1,2,... (3) 

8'(X,) 

will converge to 7 if the initial guess x, is sufficiently close to 7. (If f’(n)= 
0, then | f’(x)| <A <1 for |x—‘| sufficiently small.) Indeed, the choice of 
h(x)=f"(x) is an optimal choice of h(x), since the convergence of x, to 7 
will be extremely rapid. This follows immediately from the fact that the 
number A in Equation 4, Section 1.11 can be taken arbitrarily small, as x, 
approaches 7. 

The iteration scheme (3) is known as Newton’s method for solving the 
equation g(x)=0. It can be shown that if g(n)=0, and x, is sufficiently 
close to 7, then 

IXna1— 1 <clx,— a)’, 
for some positive constant c. In other words, the error we make in ap- 
proximating 7 by x,,, 18 proportional to the square of the error we make 
in approximating 7 by x,. This type of convergence is called quadratic con- 
vergence, and it implies that the iterates x, converge extremely rapidly to 
n. In many instances, only five or six iterations are required to find 7 to 
eight or more significant decimal places. 


Example 1. Use Newton’s method to compute V2. 
Solution. The square root of two 1s a solution of the equation 


g(x)=x?-2=0. 
Hence, Newton’s scheme for this problem is 
eric (x, —2) 
Smet) 
xX, 1 
=Zz +> n=0,1,2,.... (4) 


Sample Pascal and Fortran programs to compute the first N iterates of an 
initial guess x, are given below. 


Pascal Program 


Program Newton (input, output); 
var 

X: array(0.. 199] of real; 

k, N: integer; 
begin 

readin(X(0}, N); 

page; 
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writeln(‘N’:4, “X[N]’:14); 
for k :=0 to N do 
begin 
writeln(K:4, ‘ °:4, X[k]:17:9); 
X[k + 1] :=X[k]/2 + 1/X[k]; 
end; 
end. 


Table 1 


1.4 1.41421356 
1.41428571 1.41421356 
1.41421356 1.41421356 


Fortran Program 


We need only replace the instructions for computing X(1) and X(K) in the 
Fortran program of Section 1.11 by 
X(1)=(X0/2)+1/X0 
and 
X(K) =(X(K — 1) /2)+1/X(K—1) 


We ran these programs for x)= 1.4 and N=S, and the results are given in 
Table 1. Notice that Newton’s method requires only 2 iterations to find 


V2 to eight significant decimal places. 
See also C Program 2 in Appendix C for a sample C program. 


Example 2. Use Newton’s method to find the impact velocity of the drums 


in Section 1.7. 
Solution. The impact velocity of the drums satisfies the equation 


g(v)=ot 


300 - _B- 
Ww Bin| # B 2 ]=0 5) 


Ww WB 
where 
c=0.08, g=32.2, W=527.436, and B=470.327. 
Setting a=(W-— B)/c and d=300cg/W puts (5) in the simpler form 
g(v)=o+d+aln(1—v/a)=0. (6) 
Newton’s iteration scheme for this problem is 
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g(v,) (1—v,/a)[ 0, + d+ aln(1—v,/a)] 
41 =O, 7 =0, + 
" g(v,) v,/a 


Sc 


a-v, 
=v, + — lo, +dtain(i—v,/a)], n=0,1,2,.... 


n 
n 


Sample Pascal and Fortran programs to compute the first N iterates of ug 
are given below. 


Pascal Program 


Program Newton (input, output); 


const 
c =0.08; 
g =32.2; 
W = 527.436; 
B = 470.327; 


var 
V: array[0..199] of real; 
a, d: real: 
k, N: integer; 


begin 
readin(V[0], N); 
a:=(W—B)/c; 
d :=300«c*g/W; 
page; 
writeln(‘N’:4, ‘V[N]’:14); 
for k:=0 to Ndo 
begin 
writeln(K:4, ‘ °:4, V[k]:17:9); 
V(k+ 1] :=V[k] + ((a—V[k])/V[k]) 
+ (V[k] +d+a*In(1—(V[k]/a))); 
end; 
end. 


Fortran Program 


Change every X to V, and replace the instructions for X(1) and X(K) in the 
Fortran program of Section 1.11 by 


V(1)=V0O+((A — VO)/V0)*(VO+D+A*A LOG(1 — (VO /A))) 
and 
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V(K) =V(K—1)+((A—V(K—1)) /V(K — 1) (VK —1)+D 
+A*xALOG(1—(V(K—1) /A))) 


(Before running these programs, of course, we must instruct the computer 
to evaluate the constants a=(W— B)/c and d=300 cg/W.) 

As was shown in Section 1.7, v9 =45.7 is a very good approximation of 
v. We set v9 =45.7 in the above programs, and the iterates v, converged 
very rapidly to v=45.1 ft/s. Thus, the drums can indeed break upon im- 
pact. 


In general, it is not possible to determine, a priori, how many iterations 
will be required to achieve a certain accuracy. In practice, we usually take 
N very large, and instruct the computer to terminate the program if one of 
the iterates agrees with its predecessor to the desired accuracy. 

See also C Program 3 in Appendix C for a sample C program. 


EXERCISES 


1, Show that the iterates x, defined by (4) will converge to V2 if 
V2/3 <x9<V2 +(V2 -V2/3 ). 
2, Use Newton’s method to find the following numbers to 8 significant decimal 
places. (a) V3 , (b) V5, (c) V7. 
3. The number 7 is a root of the equation 
x x _ 
tanz — cot7 =(), 


Use Newton’s method to find 7 to 8 significant decimal places. 


Show that each of the following equations has a unique solution in the 
given interval, and use Newton’s method to find it to 5 significant decimal 
places. 


4. 2x—tanx=0; a<x<30/2 5. $-x+tsinx=0; i<x<l 
6. Inx+(x+1)?=0; O0<x<l 7. 2Vx =cos 5; O<x<l 
8. (x— 1) —+e*=0; 0<x<l 9. x-e-* =I: 0<x <2. 


1.12 Difference equations, and how to compute 
the interest due on your student loans 


In Sections 1.13-1.16 we will construct various approximations of the solu- 
tion of the initial-value problem dy /dt=f*(t,y), y(t) =o. In determining 
how good these approximations are, we will be confronted with the follow- 
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ing problem: How large can the numbers £,,..., £,, be if 


E,,,<AE,+B,  n=0,1,...,.N—-1 (1) 


n 


for some positive constants A and B, and E,=0? This is a very difficult 
problem since it deals with inequalities, rather than equalities. Fortunately, 
though, we can convert the problem of solving the inequalities (1) into the 
simpler problem of solving a system of equalities. This is the content of the 
following lemma. 


Lemma 1. Let E,,...,Ey satisfy the inequalities 
E4,<AE, +B, E =0 


n 
for some positive constants A and B. Then, E,, is less than or equal to y,, 
where 


Yn+1= Ay, + B, Yo=O. (2) 


Proor. We prove Lemma | by induction on n. To this end, observe that 
Lemma | 1s obviously true for n=0. Next, we assume that Lemma | 1s true 
for n=j. We must show that Lemma | is also true for n=j+1. That is to 
say, we must prove that E;< y, implies E,,,< y,,,. But this follows im- 
mediately, for if E,< y, then 


Ei4,<AE,+ B< Ay, + B=yj41. 


By induction, therefore, EF, < y,, n=0,1,...,N. CO 
Our next task is to solve Equation (2), which is often referred to as a dif- 
ference equation. We will accomplish this in two steps. First we will solve 

the “simple” difference equation 
Yn+1=Ynt+ By, Yo=o- (3) 


Then we will reduce the difference equation (2) to the difference equation 
(3) by a clever change of variables. 
Equation (3) is trivial to solve. Observe that 


Adding these equations gives 


(Yn —Yn—1) + (Vn-1 -Yn—2) Fe F(T Yo) = Bot Bit... + By}: 


Hence, 
n—-1 
Ve=Vot Bot... +B, ,=yot > B.. 
j=0 
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Next, we reduce the difference equation (2) to the simpler equation (3) 
in the following clever manner. Let 


Yn 


Z, = An”? n=0,1,...,N. 
Then, z,,,=y,4,;/A"*'. But y,, ,= Ay, + B. Consequently, 
Yn B B 
Znti= Gn nat 2 Tal” 
Therefore, 
1 n 
n-1 1-(+] 
a B _. LB A 
Z, = Zot a 41 =Yot 7 7 
- A 
+ 1-(1) 
%ov “4-1 A 
and 
n n B n 
Yn=A Z,=A Yot GA —1). (4) 
Finally, returning to the inequalities (1), we see that 
E,<z=7(A"=1), n=1,2,...,N. (5) 


While collecting material for this book, this author was approached by a 
colleague with the following problem. He had just received a bill from the 
bank for the first payment on his wife’s student loan. This loan was to be 
repaid in 10 years in 120 equal monthly installments. According to his 
rough estimate, the bank was overcharging him by at least 20%. Before 
confronting the bank’s officers, though, he wanted to compute exactly the 
monthly payments due on this loan. 

This problem can be put in the following more general framework. 
Suppose that P dollars are borrowed from a bank at an annual interest rate 
of R%. This loan is to be repaid in n years in equal monthly installments of 
x dollars. Find x. 

Our first step in solving this problem is to compute the interest due on 
the loan. To this end observe that the interest J, owed when the first pay- 
ment is due is J, =(r/12)P, where r= R/100. The principal outstanding 
during the second month of the loan is (x — J,) less than the principal out- 
standing during the first month. Hence, the interest J, owed during the sec- 
ond month of the loan is 


r 
Similarly, the interest J;, , owed during the (j+ 1)st month is 
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r r r 
Iar=h-Zle-h)=(14+ 5 )h- ps (6) 
where J; is the interest owed during the jth month. 
Equation (6) is a difference equation for the numbers 


r 


N= 


Py Io, .0-5L a9: 
Its solution (see Exercise 4) is 
_r r \J7! r \i7! 
L= P(1+ 55) tx} 1-(14 5) | 


Hence, the total amount of interest paid on the loan is 
12n 


T=1,+1,+...+1),= > I, 
j=] 
; 12n . - 12n . \irl 
= 77? (1475) + 12me—x 3 (1+ 7p) 
Now, 
Ss r I 12 r \'2n 
2 (1+ 33) => |(1+ 7) -1| 
Therefore, 
p=P|(1+9y) 1] +12mx- Bl (14 FY] 
=12me—P+P(I+75) -BA[(1+5) "1 


But, 12nx — P must equal J, since 12nx is the amount of money paid the 
bank and P was the principal loaned. Consequently, 


P(1+-f)"-2{(14 )"-1]-0 


and this equation implies that 


Pl! 5). 
i ar, (7) 
(i+) -1 


Epilog. Using Equation (7), this author computed x for his wife’s and 
his colleague’s wife’s student loans. In both cases the bank was right—to 
the penny. 
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EXERCISES 


1. 


Solve the difference equation y,,,= —7y, +2,yo= I. 


2. Find y37 if ¥n4,=3y,+ 1, Yo=9, n=0,1,...,36. 


10. 


. Estimate the numbers £o, £),...,£y if E>=0 and 


(a) E,4,<3E,+1, n=0,1,...,.N—1; 
(b) E,4,<2E, +2, n=0,1,...,.N—1. 


. (a) Show that the transformation y;= J;,, transforms the difference equation 


J 
r 


=P 


ban(It goa A 


into the difference equation 


r r r 
Yya1=(14 F)y- Ts Yo= 75? 


(b) Use Equation (4) to find y,_,= J. 


. Solve the difference equation y,,, =a,y, + 5,, y,; =a. Hint: Set z;=y, and 


Z,=),/a,..-d,—, for n>2. Observe that 


Q,.--Ay ; 


Hence, conclude that z,=z,+272{b;/a...a;. 


. Solve the difference equation y,,,—ny,=1—n, y,=2. 
° Find y>5 if y,=1 and (n+ Ll) V4) 7 Y, = 2"; n= 1,...,24. 


. A student borrows P dollars at an annual interest rate of R%. This loan is to be 


repayed in n years in equal monthly installments of x dollars. Find x if 
(a) P=4250, R=3, and n=5; 
(b) P=5000, R=7, and n= 10. 


. A home buyer takes out a $30,000 mortgage at an annual interest rate of 9%. 


This loan is to be repaid over 20 years in 240 equal monthly installments of x 
dollars. 

(a) Compute x. 

(b) Find x if the annual interest rate is 10%. 


The quantity supplied of some commodity in a given week is obviously an in- 
creasing function of its price the previous week, while the quantity demanded 
in a given week is a function of its current price. Let S; and D, denote, respec- 
tively, the quantities supplied and demanded in the jth week, and let P; denote 
the price of the commodity in the jth week. We assume that there exist positive 
constants a, b, and c such that 


S=aP,_, and D,=b-—cP,. 
(a) Show that P,=b/(a+c)+(—a/c)/(Py— b/(a+c)), if supply always 
equals demand. 
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(b) Show that P; approaches b/(a+c) as j approaches infinity if a/c <1. 

(c) Show that P=b/(a+c) represents an equilibrium situation. That is to say, 
if supply always equals demand, and if the price ever reaches the level 
b/(a+c), then it will always remain at that level. 


1.13 Numerical approximations; Euler’s method 


In Section 1.9 we showed that it is not possible, in general, to solve the ini- 
tial-value problem 


o =f(ty),  y(to)=Yo. (1) 


Therefore, in order that differential equations have any practical value for 
us, we must devise ways of obtaining accurate approximations of the solu- 
tion y(t) of (1). In Sections 1.13—1.16 we will derive algorithms, which can 
be implemented on a digital computer, for obtaining accurate approxima- 
tions of y(Z). 

Now, a computer obviously cannot approximate a function on an entire 
interval tg < t< tg9+ a since this would require an infinite amount of infor- 
mation. At best it can compute approximate values y,,...,yy of y(t) ata 
finite number of points ¢,,/,,...,¢y. However, this is sufficient for our pur- 
pose since we can use the numbers y,,...,y¥, to obtain an accurate ap- 
proximation of y(t) on the entire interval tg< t<t)+ a. To wit, let y(t) be 
the function whose graph on each interval [¢,¢,, ,] is the straight line con- 
necting the points (1,,y,) and (¢,, 1,¥;4,) (see Figure 1). We can express y(t) 
analytically by the equation 


, 1 
y (t)=y,+ ran ti)(Yj417Y;)> GStSGah 


Figure 1. Comparison of y(t) and y(t) 
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If y(t) is close to y(Z) at t=1,; that is, if y, is close to y(z), and if ¢,,, is 
close to 4, then y(t) is close to y(t) on the entire interval ¢;<¢<4,,. This 
follows immediately from the continuity of both y(Z) and y(t). Thus, we 
need only devise schemes for obtaining accurate approximations of y(t) at 
a discrete number of points ¢,,...,¢, in the interval tf) << ¢< tg+a. For sim- 
plicity, we will require that the points ¢,,...,¢, be equally spaced. This is 
achieved by choosing a large integer N and setting t, =1,+k(a/N), k= 
1,...,M. Alternately, we may write ¢,,,=t,+h where h=a/N. 

Now, the only thing we know about y(f) is that it satisfies a certain dif- 
ferential equation, and that its value at t= fp) is yo. We will use this infor- 
mation to compute an approximate value y, of y at t=t,=)+h. Then, we 
will use this approximate value y, to compute an approximate value y, of y 
at ¢=¢,=1t,+Ah, and so on. In order to accomplish this we must find a the- 
orem which enables us to compute the value of y at t=t, +h from the 
knowledge of y at t=4,. This theorem, of course, is Taylor’s Theorem, 
which states that 
Hn) , 12 Flt) 
y4t+h= v(t) the +e WP + 


(2) 


Thus, if we know the value of y and its derivatives at t=¢,, then we can 
compute the value of y at t=t,+h. Now, y(t) satisfies the initial-value 
problem (1). Hence, its derivative, when evaluated at ¢=7,, must equal 
f(t,,y¥(t,)). Moreover, by repeated use of the chain rule of partial dif- 
ferentiation (see Appendix A), we can evaluate 


“A -[Ser$)oo 


and all other higher-order derivatives of y(t) at t=t,. Hence, we can re- 
write (2) in the form 


Y(tye41) ae + iu) 


ay | 2 tix | Hort) T. (3) 


The simplest approximation of y(t,,,) is obtained by truncating the 
Taylor series (3) after the second term. This gives rise to the numerical 
scheme 


roe 


YVi=Yothf(toyvo), = Ya=Yithf(t1), 
and, in general, 
Ver r= Vet Mf (teVe), — Vo= Y(t) (4) 
Notice how we use the initial-value y, and the fact that y(f) satisfies the 
differential equation dy /dt=f(t,y) to compute an approximate value y, of 


y(t) at t=4,. Then, we use this approximate value y, to compute an ap- 
proximate value y, of y(t) at t=¢,, and so on. 
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Equation (4) is known as Euler’s scheme. It is the simplest numerical 
scheme for obtaining approximate values y,,...,¥, of the solution y(Z) at 
times ¢,,...,¢,. Of course, it is also the least accurate scheme, since we have 
only retained two terms in the Taylor series expansion for y(t). As we shall 
see shortly, Euler’s scheme is not accurate enough to use in many prob- 
lems. However, it is an excellent introduction to the more complicated 
schemes that will follow. 


Example 1. Let y(¢) be the solution of the initial-value problem 


dy/dt=1+(y-t), — y(0)=3. 


Use Euler’s scheme to compute approximate values y,,...,y, of y(f) at the 
points 4; =1/N,t,=2/N,...,ty=1. 
Solution. Euler’s scheme for this problem is 


Yer =Ythll+(y%-G) |,  k=0,1,..,.N-1, h=1/N 


with y, = 4. Sample Pascal and Fortran programs to compute y,,..., yy are 
given below. These programs, as well as all subsequent programs, have 
variable values for f, yp, a, and N, so that they may also be used to solve 
the more general initial-value problem dy/dt=1+(y— 0)’, y(to)=yo on 
any desired interval. Moreover, these same programs work even if we 
change the differential equation; if we change the function f(t,y) then we 
need only change line 12 in the Pascal program (and line 11 in the C program) 
and the expressions for Y(1) and Y(K) in Section B of the Fortran program. 


Pascal Program 
Program Euler (input, output); 


var 
T, Y: array[0..999] of real; 
a, h: real; 
k, N: integer; 


begin 
readin(T [0], Y(0], a, N); 
h:=a/N; 
page; 
for k:=0 to N—1 do 
begin 
T{kK+ 1] :=T[k] +h; 
Y[k + 1} :=Y[k] +h + (1+ (Y[k] — T[k]) * (Y[k] — T[k])); 
end; 
writeln(‘T’:4, ‘Y’:16); 
for k :=0Oto N do 
writeln(T[k]:10:7, ‘ °:2, Y[k]:16:9); 
end. 
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Fortran Program 


Section A DIMENSION T(1000), Y(1000) 
Read in data READ (5,10) TO, YO, A, N 
10 FORMAT (3F20.8, |5) 
H=A/N 
Section B T(1)=TO+H 
Do computations Y(1)=YO+H«(1+(YO— TO) *# *2) 
DO 20 K=2,N 


T(K)=T(K—1)+H 
Y(K) = Y(K— 1) +H*(1+(Y(K—1) 
1| —T(K—1))#* *2) 
20 CONTINUE 


Section C WRITE (6, 30) TO, YO, (T(J), Y(J), J=1,N) 
Print out 30 FORMAT (1H1, 3X, 1HT, 4X, 1HY, /(1H, 1X, 
results 1| F10.7, 2X, F20.9/)) 

CALL EXIT 

END 


See also C Program 4 in Appendix C for a sample C program. 
Table 1 below gives the results of these computations for a=1, N=10, 45= 
0, and y)= i All of these computations, and all subsequent computations, 
were carried out on an IBM 360 computer using 16 decimal places ac- 
curacy. The results have been rounded to 8 significant decimal places. 


Table 1 


0.5 ; 1.29810115 
0.625 . 1.44683567 


0.7525625 ; 1.60261202 
0.88309503 . 1.76703063 
1.01709501 1.94220484 
1.15517564 


The exact solution of this initial-value problem (see Exercise 7) is 


y(t)=t+1/(2—2). 
Thus, the error we make in approximating the value of the solution at t= 1 
by yj is approximately 0.06, since y(1)=2. If we run this program for N= 
20 and N=40, we obtain that yo) = 1.96852339 and y4)= 1.9835109. Hence, 
the error we make in approximating y(1) by yo is already less than 0.02. 
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EXERCISES 


Using Euler’s method with step size h=0.1, determine an approximate 
value of the solution at t=1 for each of the initial-value problems 1-5. Re- 
peat these computations with h=0.025 and compare the results with the 
given value of the solution. 


1 Zaiti-y, yO=0; W=9 


2, F ary, y()=2; ((=2€") 


3. Baity?—2, yO=0; (N= 


4. 5 ate? + 5, 1O)=0; (x(N=In1 +) 


2 


dy y 2 
eo = —14+2t+ , 0)=1; ti=1+t 
Zi game POTKOOH149 


6. Using Euler’s method with h=7/40, determine an approximate value of the 
solution of the initial-value problem 
@ = 2sec?t—(1+y?), y(0)=0 


at t=7/4. Repeat these computations with h=7/160 and compare the results 
with the number one which is the value of the solution y(t)=tant at t=7/4. 


7. (a) Show that the substitution y = 1+ z reduces the initial-value problem y’ = 1+ 
(y — 2)’, y(0)=0.5 to the simpler initial-value problem z’ =z, z(0)=0.5. 
(b) Show that z(t)=1/(2—2). Hence, y(t)=t+1/(2—2). 


1.13.1 Error analysis for Euler’s method 


One of the nice features of Euler’s method is that it is relatively simple to 
estimate the error we make in approximating y(t,) by y,. Unfortunately, 
though, we must make the severe restriction that f,,...,t, do not exceed f, 
+ a, where a is the number defined in the existence—uniqueness theorem of 
Section 1.10. More precisely, let a and b be two positive numbers and 
assume that the functions f, df/d¢, and of/dy are defined and continuous 
in the rectangle tg < t< to +a, yyp—b< y< yotb. We will denote this rect- 
angle by R. Let M be the maximum value of | f(t,y)| for (t,y) in R, and set 
a=min(a,b/M). We will determine the error committed in approximating 
y(t,) by y,, for t, < tot a. 

To this end observe that the numbers yo,y,,...,yy Satisfy the difference 
equation 


VerraVet hf (ter) k=0,1,....N—-1 (1) 
while the numbers y(‘p),y(¢,),---,¥(ty) Satisfy the difference equation 
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a of 
of ty | Ger Ge) (2) 


where is some number between ¢, and ¢,,,. Equation (2) follows from 
the identity 


V (tee DEV (4) + (49 (4) +4 > 


d*y _ of of 
dt? ot * ay 
and the fact that ; 
on _e d*y(r) 


y(tth)=y(t)t+h 


vy dt? 
for some number + between ¢ and ¢+h. Subtracting Equation (1) from 
Equation (2) gives 

Y(t) Vea =Y (4K) Vet er) —f (ter) | 


ay Fy [Ge (&, ))- 


+e 


Next, observe that 
F (tesY (te) —F tee) = L Y(t) He | 


where 7, is some number between y(t,) and y,. Consequently, 


Of (tes) 
dy 


Of (tes Nk) 
dy 


IY (tear Yeas S| (4) —y,|+h | (te) —Yx| 


h2\{ of 
TH ar 


0 
+e | (&; )) . 


In order to proceed further, we must obtain estimates of the quantities 
(Of (t,,n,))/ oy and [(df/ dt) + f(df/ dy)|(§,,y (§)). To this end observe that 
the points (&,,y(&,)) and (¢,,y;,) all lie in the rectangle R. (It was shown in 
Section 1.10 that the points (€,,y(&,)) lie in R. In addition, a simple induc- 
tion argument (see Exercise 9) shows that the points (¢,,y,) all lie in R.) 
Consequently, the points (t,,7,) must also lie in R. Let L and D be two 
positive numbers such that 


of 
yy < 
of 


max |— 
(t,y) in R yt dy 


Such numbers always exist if f, df/dt, and df/dy are continuous in R. 
Then, 


max 
(t,y)in R 

and 

< D. 


Dh? 
IY Cent) Veil $1Y (4) Yel + ALLY (4) — Yel + Z- (3) 
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Now, set FE, =|y(t,)—y,|, k=0,1,...,N. The number £, is the error we 
make at the kth step in approximating y(t,) by y,. From (3) 


Dh? 
Ey41 S(LFAL)E,+ =, k=0,1,...N-1, (4) 


Moreover, E,=0 since y(tp)=yo. Thus, the numbers Ep, E,,...,Ey satisfy 
the set of inequalities 
FE, 4, <AE, + B, E,)=0 


with A=1+hAL and B= Dh*/2. Consequently, (see Section 1.12) 
B 
A-1 

We can also obtain an estimate for £, that is independent of k. Observe 
that 1+hL < e*”. This follows from the fact that 


E, <q (A*=1)= 24 [ (1+ AL) - 1]. (5) 


(AL) | (ALY 
Tay 
=(1+AL)+ “something positive’. 


eth =14+hAL+ 


Therefore, 


Dh k Dh 
E, <5, | (e”) -1]=57[e@*—1]. 


Finally, since kh < a, we see that 
Dh « 
E<xy lew], kad. N. (6) 
Equation (6) says that the error we make in approximating the solution 
y(t) at ime t=4, by y, is at most a fixed constant times h. This suggests, as 
a rule of thumb, that our error should decrease by approximately ; if we 
decrease h by 3. We can verify this directly in Example 1 of the previous 


section where our error at t=1 for h=0.1, 0.05, and 0.025 is 0.058, 0.032, 
and 0.017 respectively. 


Example 1. Let y(t) be the solution of the initial-value problem 


dy t*+y? 
a ) 3 y(0)=0. 


(a) Show that y(¢) exists at least for 0<¢< 1, and that in this interval, —1 
< y(O<l. 

(b) Let N be a large positive integer. Set up Euler’s scheme to find ap- 
proximate values of y at the points t, =k/N, k=0,1,...,N. 

(c) Determine a step size h=1/WN so that the error we make in approxi- 
mating y(¢,) by y, does not exceed 0.0001. 

Solution. (a) Let R be the rectangle O<¢<1, —1<y<1. The maximum 
value that (¢?+,y7)/2 achieves for (t,y) in R is 1. Hence, by the exist- 
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ence—uniqueness theorem of Section 1.10, y(t) exists at least for 
O0<t<a= min(1, t)=1, 


and in this interval, -l<y<l. 


te t+ YE 1[(kY 
(b) res ret h( 9S*) art oly (=) + yp 


with y»=0. The integer k runs from 0 to N—1. 
(c) Let f(t,y)=(t7+y’)/2, and compute 
Of J of 24 y2 
y and a tf5 yet a(t +y*). 
From (6), |y(t,)—y,|< (Dh bye! — 1) where LZ and D are two positive 


numbers such that 


max |y|< 
(t,y) in R 


and 


max [f+ —(t*+y2)|<D. 
(t,y) in R | ») 


Now, the maximum values of the functions | y| and |f+(y/2)(t?+,y2)| for 
(t,y) in R are clearly 1 and 2 respectively. Hence, 
2h 
(i) — el < (e- 1) =h(e- 1). 


This implies that the step size h should be smaller than 0.0001 /(e— 1). 
Equivalently, N should be larger than (e—1)10*=17,183. Thus, we must 


iterate the equation 
2 


_ l k 2 
Yer1=Ye* 277 183) [ 7.183} Yk 


17,183 times to be sure that y(1) is correct to four decimal places. 


Example 2. Let y(t) be the solution of the initial-value problem 


dy 
dt 
(a) Show that y(¢) exists at least for 0<¢< 1, and that in this interval, —1 
<y <3. 
(b) Let N be a large positive integer. Set up Euler’s scheme to find ap- 
proximate values of y(f) at the points 1 =k/N, k=0,1,...,N 
(c) Determine a step size h so that the error we make in approximating 
y(t,) by y, does not exceed 0.0001. 
Solution. 
(a) Let R be the rectangle 0< ¢< 1, |y—1|<2. The maximum value that 7? 
+e” achieves for (7,y) in R is 2. Hence, y(¢) exists at least for O<t< 
min(1,2/2)=1, and in this interval, —1< y <3. 


=f+te”, y(O)=1. 
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(b) Year =e FAG? +e )=y, + (1/N(K/ NYP +e] with yo= 1. The 
integer k runs from 0 to N—1. 
(c) Let f(t,y)=12+e-” and compute 
of of of 
—_— = —y — — 
ay 2ye *, and ry +45, 
From (6), | y(t,)—y,|<(Dh/2L)(e*—1) where L and D are two positive 
numbers such that 


=2t—2y(?+e-” em”. 


max |—2ye”"|<L 
(t,y) in R 


and 
max |2¢—2y(t7+ e” je~»”"| < D. 
(4,y) in R 
Now, it is easily seen that the maximum value of |2ye -¥"| for -l< y<3 1s 


V2 /e . Thus, we take L= V2/e . Unfortunately, though, it is extremely 
difficult to compute the maximum value of the function 


l2t—2y(t? +e” Je” 


for (t,y) in R. However, we can still find an acceptable value D by observ- 
ing that for (t,y) in R, 


max|2t—2v(t?+e~” )e~”| < max|2¢| + max|2y (1? +e” )e~”| 


< max|2¢|+max|2ye-”’| x max(t?+ e~”) 
=2+2V2/e =2(1+ V2/e ). 
Hence, we may choose D=2(1+ V2/e ). Consequently, 
2(1+ V2/e Jal eV?/* -1] 
2V2/e | 


This implies that the step size h must be smaller than 
V2/e 0.0001 
1+ V2/e  eV2/e -] 


Examples 1 and 2 show that Euler’s method is not very accurate since 
approximately 20,000 iterations are required to achieve an accuracy of four 
decimal places. One obvious disadvantage of a scheme which requires so 
many iterations is the cost. The going rate for computer usage at present is 
about $1200.00 per hour. A second, and much more serious disadvantage, 
is that y, may be very far away from y(t,) if N is exceptionally large. To 
wit, a digital computer can never perform a computation exactly since it 
only retains a finite number of decimal places. Consequently, every time 
we perform an arithmetic operation on the computer, we must introduce a 


1Y (t,) — Yel < 
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“round off” error. This error, of course, is small. However, if we perform 
too many operations then the accumulated round off error may become so 
large as to make our results meaningless. Exercise 8 gives an illustration of 
this for Euler’s method. 


EXERCISES 


1. Determine an upper bound on the error we make in using Euler’s method with 
step size h to find an approximate value of the solution of the initial-value prob- 
lem 


a2? XO)! 
at any point ¢ in the interval [0, 2]. Hint: Let R be the rectangleO<1<10<y< 
2. 


2. Determine an upper bound on the error we make in using Euler’s method with 
step size A to find an approximate value of the solution of the initial-value prob- 
lem 


dy _._y4 _ 
a! a y(0)=0 


at any point ¢ in the interval [0, 1]. Hint: Let R be the rectangleO<t<1, -—l<y 
<1. 


3. Determine an upper bound on the error we make in using Euler’s method with 
step size h to find an approximate value of the solution of the initial-value prob- 
lem 


—=tt+e”, y(0)=0 


at any point ¢ in the interval [0,1 /(e+ 1)]. Hint: Let R be the rectangle 0 <i¢< 1, 
-l<y<l. 


4. Determine a suitable value of h so that the error we make in using Euler’s 
method with step size h to find an approximate value of the solution of the ini- 
tial-value problem 


dy 
Hae ys ~—-¥(0)=0 


at any point ¢ in the interval [0,1/e] is at most 0.0001. Hint: Let R be the rect- 
angle O<t<l1, -—Il<y<l. 


5. Determine a suitable value of h so that the error we make in using Euler’s 
method with step size h to find an approximate value of the solution of the ini- 
tial-value problem 


@ =P +tan’y, y(0)=0 


at any point ¢ in the interval [0, $] is at most 0.00001. Hint: Let R be the rectan- 
gleO<t<}, -a7/4<y<a/4. 
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6. Determine a suitable value of h so that the error we make in using Euler’s 
method with step size h to find an approximate value of the solution of the ini- 
tial-value problem 


dy 1 


di japey? 720° 


at any point ¢ in the interval [0, 1] is at most 0.0001. Hint: Let R be the rectangle 
O0<rt<l, -I<y<l. 


7, Let y(t) be the solution of the initial-value problem 


dy _ 
Suppose that | f(t,y)| <1, |df/dy| <1, and |(af/dt)+f(df/dy)| <2 in the rectan- 
gle 0<t<1, —1<_y<1. When the Euler scheme 
] 
Ves vet hfe yy), = N 
is used with N=10, the value of y, is —0.15{(2)°—1], and the value of y¢ is 
0.12{(44)°— 1]. Prove that y(t) is zero at least once in the interval (+, 2). 


8. Let y(t) be the solution of the initial-value problem 


y=f(ty), (to) =Yo. 
Euler’s method for finding approximate values of y(t) is y.41=y, + Af (ts Yx): 
However, the quantity y, +Af(t,,y,) is never computed exactly: we always in- 
troduce an error ¢, with |¢,|<«. That is to say, the computer computes numbers 
V1¥2-.., Such that 

Der = Vit Mf (tsIe) + & 


with Yo=yo. Suppose that |df/dy| < L and |(df/9dr)+ f(af/dy)| < D for all ¢ and 


y. 
(a) Show that 


~ D 
Exe =|) (e+) Frail <A + ALE, + ahite 


(b) Conclude from (a) that 


for kh<a. 
(c) Choose h so that the error E, is minimized. Notice that the error E, may be 
very large if h is very small. 


9, Let y),y5,... Satisfy the recursion relation 


Yar =Ver hf (teVx)- 
Let R be the rectangle tg << ¢< tg + a,yyp—b< y < Yot 5, and assume that | f(t, y)| 
< M for (t,y) in R. Finally, let a=min(a,b/M). 
(a) Prove that |y;—yo| < jhM, as long as jh < a. Hint: Use induction. 
(b) Conclude from (a) that the points (¢,,y,) all lie in R as long as j <a/h. 
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1.14 The three term Taylor series method 


Euler’s method was derived by truncating the Taylor series 
Y(t 41) io + er GO) 


oy 155 [lux (id) + (1) 


after the second term. The most obvious way of obtaining better numerical 
schemes is to retain more terms in Equation (1). If we truncate this Taylor 
series after three terms then we obtain the numerical scheme 


of 


Yee et AL (yd) + at Fy |e) k=0,....N—1 (2) 


with yo=y (to). 

Equation (2) is called the three term Taylor series method. It is obviously 
more accurate than Euler’s method. Hence, for fixed h, we would expect 
that the numbers y, generated by Equation (2) are better approximations 
of y(t,) than the numbers y, generated by Euler’s scheme. This is indeed 
the case, for it can be shown that | y(t,)—y,| is proportional to h* whereas 
the error we make using Euler’s method is only proportional to h. The 
quantity h? is much less than h if h is very small. Thus, the three term 
Taylor series method is a significant improvement over Euler’s method. 


Example 1. Let y(¢) be the solution of the initial-value problem 
dy _ 1 


Use the three term Taylor series method to compute approximate values of 
y(2) at the points 1, =k/N, k=1,...,N. 


Solution. Let f(t,y)=1+(y— 2). Then, 


Lj $= =A y=1)4Ay-O[14(y~-0*]=20- 9" 


Hence, the three term Taylor series scheme is 


Vet 1 Vet h| 1+ (%- th) | +? (yt)? 


with h = 1/N and y, = 5. The integer k runs from 0 to N — 1. Sample Pascal 
and Fortran programs to compute y,,...,y,y are given below. Again, these 
programs have variable values for fp, yo, a, and N. 
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Pascal Program 


Program Taylor (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, Temp: real; 
k, N: integer; 


begin 
readin(T [0], Y[0], a, N); 
h:=a/N; 
page; 
for k:=0 to N—1 do 
begin 
Temp := Y (k] — T[k]; 
T{k+ 1] :=T[k] +h; 
Y(k+ 1] := Y[k] + h* (1+ Temp + Temp) 
+h+h*Temp* Temp+« Temp; 
end; 
writeln(‘T’:4, ‘Y’:16); 
for k :=0 to N do 
writeln(T [k]:10:7, ‘ °:2, Y[k]:16:9); 
end. 


Fortran Program 


Replace Section B of the Fortran program in Section 1.13 by the follow- 
ing: 


T(1)=TO+H 

D2Y =H*(YO—TO0)+# «3 

Y(1)=YO+H+#(D2Y +1+(YO—TO)# *2) 

D020 K=2,N 

T(K)=T(K—1)+H 

D2Y =H+(Y(K—1)—T(K—1))* #3 

Y(K) = Y(K—1) +H*(D2Y + 1+(Y¥(K—1)—T(K—1))* #2) 
20 CONTINUE 


See also C Program 5 in Appendix C for a sample C program. 
Table 1 below shows the results of these computations for a=1, N= 10, 
ty>=0, and y)>=0.5. 


Now Euler’s method with N=10 predicted a value of 1.9422 for y(1). No- 
tice how much closer the number 1.9957 is to the correct value 2. If we run 
this program for N=20 and N=40, we obtain that y,.=1.99884247 and 
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Table 1 


y 
1.31331931 


. . 1.46783 13 
0.7554013 . 1.63131465 
0.88796161 . 1.80616814 
1.02456407 1.995723 13 
1.1660084 


Y4g = 1.99969915. These numbers are also much more accurate than the val- 
ues 1.96852339 and 1.9835109 predicted by Euler’s method. 


EXERCISES 


Using the three term Taylor series method with h=0.1, determine an ap- 
proximate value of the solution at t=1 for each of the initial-value prob- 
lems 1-5. Repeat these computations with h=0.025 and compare the re- 
sults with the given value of the solution. 


. dy /dt=1+t-y, yO=0; (VXO=9 

. dy/dt=2ty, yO)=2; (y()=2e") 

. dy /dt=1+y?—0, yO=0; (y(N=4) 

. dy/dt=te"+t/(1+27), yO=0; (y()=In(11+2’)) 
. dy/dt=—-1+2t+y*/+r?y, yO=H1, YYOD=14+2) 


. Using the three term Taylor series method with h=7/40, determine an ap- 
proximate value of the solution of the initial-value problem 


nO wa & |] HN me 


dy 
= 2sect—-(1+y’), y(0)=0 


at t=7/4. Repeat these computations with h= 7/160 and compare the results 
with the number one which is the value of the solution y(#)=tant at t=7/4. 


1.15 An improved Euler method 


The three term Taylor series method is a significant improvement over 
Euler’s method. However, it has the serious disadvantage of requiring us to 
compute partial derivatives of f(t,y), and this can be quite difficult if the 
function f(t,y) is fairly complicated. For this reason we would like to de- 
rive numerical schemes which do not require us to compute partial deriva- 
tives of f(t,y). One approach to this problem is to integrate both sides of 
the differential equation y’=f(t,y) between ¢, and ¢,+h to obtain that 


Y (tes) = (te) + *" (ty) dt. (1) 
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This reduces the problem of finding an approximate value of y(t, ,) to the 
much simpler problem of approximating the area under the curve f(t,y(Z)) 
between ¢, and ¢,+h. A crude approximation of this area is hf(t,,y(t,)), 
which is the area of the rectangle R in Figure la. This gives rise to the 
numerical scheme 
Vir =Vet Mf (GeV) 

which, of course, is Euler’s method. 

A much better approximation of this area is 


lf (t,.9(t,)) +f(ths (ts ))] 


which is the area of the trapezoid T in Figure 1b. This gives rise to the 
numerical scheme 


Yer et S [Stored +S (ter wee): (2) 


However, we cannot use this scheme to determine y,,, from y, since y, 4, 
also appears on the right-hand side of (2). A very clever way of overcoming 
this difficulty is to replace y,,, in the right-hand side of (2) by the value 
y, + hf (t,,¥,) predicted for it by Euler’s method. This gives rise to the 
numerical scheme 


Ver iV tT $ [ fF ( tes Ye) + f(t, thy, hf (tex) |; Yo=V(to)- (3) 


Equation (3) is known as the improved Euler method. It can be shown 
that |y(¢,)—y,| is at most a fixed constant times h?. Hence, the improved 
Euler method gives us the same accuracy as the three term Taylor series 
method without requiring us to compute partial derivatives. 


f(t,y(t)) ~\ 


Figure | 


Example 1. Let y(t) be the solution of the initial-value problem 
dy 2 ] 
gatQ-y, yrO=>5- 
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Use the improved Euler method to compute approximate values of y(t) at 
the points t, =k/N, k=1,...,N. 
Solution. The improved Euler scheme for this problem is 


h 2 
Yer =Vet 5 {1+ Ot) 1+ [yet AC + (re WY) hee] 


with h = 1/N and yo = 0.5. The integer k runs from 0 to N — 1. Sample Pascal 
and Fortran programs to compute y,,...,y, are given below. Again, these 
programs have variable values for fo, yo, a, and N. 


Pascal Program 
Program Improved (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, R: real; 
k, N: integer; 


begin 
readin(T [0], Y{O], a, N); 
h:=a/N; 
page, 
for k:=0 to N—1 do 
begin 
R:= 1+ (Y¥[k] — T[k]) * (¥[k] — T[k)]); 
T[k+ 1] :=T[k] +h; 
Y¥(k+ 1] :=Y[k] + (h/2) *(R+1 
+(Y[k] +h+*R—T[k+ 1]) « (Y[k] +h*R—T[k+ 1])); 
end; 
writeln(‘T’:4, ‘Y’:16); 
for k :=0 to N do 
writeln(T[k]:10:7, ‘ °:2, Y[k]:16:9); 
end. 


Fortran Program 


Replace Section B of the Fortran program in Example 1 of Section 1.13 by 
the following: 


T(1)=TO+H 

R=1+(YO—TO)« #2 

¥(1)=YO+(H/2)*(R+1+(Y0+(H*#R)—T(1))# #2) 

D020 K=2,N 

T(K)=T(K—1)+H 

R=1+(Y¥(K—1)—T(K—1))# «2 

Y¥(K) = Y(K— 1) +(H/2)*(R+1+(¥(K—1)+(H*R)—T(K)) « #2) 
20 CONTINUE 
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See also C Program 6 in Appendix C for a sample C program. 

Table 1 below shows the results of these computations for a=1, N=10, 
tp =0, and y)=0.5. If we run this program for N =20 and N =40 we obtain 
that yo9= 1.99939944 and yy) = 1.99984675. Hence the values yj, y59, and 
Yao computed by the improved Euler method are even closer to the correct 
value 2 than the corresponding values 1.99572313, 1.99884246, and 


1.99969915 computed by the three term Taylor series method. 
Table 1 
J y 
0.5 ; 1.31377361 


0.62628125 . 1.46848715 
0.75547445 . 1.63225727 


0.88809117 . 1.80752701 
1.02477002 1.99770114 
1.16631867 


EXERCISES 


Using the improved Euler method with h=0.1, determine an approximate 
value of the solution at t=1 for each of the initial-value problems 1-5. Re- 
peat these computations with h=0.025 and compare the results with the 
given value of the solution. 

1. dy/dt=1+t-y, yO=0; (y(N=d 

2. dy/dt=2y, yO)=2; (y()=2e") 

3. dy/dt=1+y?—0t?, yO=0; (W(O=) 

4. dy/dt=te-”+t/(l+’), yO)=0; (y(D)=In(1 +7?) 

5. dy/dt=—-1+2t+y*/(l+?y, yOH=H1; (y(Q=1+?) 

6. Using the improved Euler method with h=7/40, determine an approximate 


value of the solution of the initial-value problem 


dy 


7 =2sec*t—(1+y7), y(0)=0 


at t=7/4. Repeat these computations with h=2/160 and compare the results 
with the number one which is the value of the solution y(t)=tan¢ at t=7/4. 


1.16 The Runge-Kutta method 


We now present, without proof, a very powerful scheme which was devel- 
oped around 1900 by the mathematicians Runge and Kutta. Because of its 
simplicity and great accuracy, the Runge-Kutta method is still one of the 
most widely used numerical schemes for solving differential equations. It is 
defined by the equation 


112 


116 The Runge-Kutta method 


h 
Verity et 6 [ Ly t2Ly 2+ 2Ly3+ Ly 4]; k=0, 1,...,N-1 
where yp>= y(t.) and 


Ly =F (te Vx) Ly =f (4, +3hy,+ shL,, i) 
Lys=Sf (t+ ght fhly2)s Leaf (thy, thls). 


This formula involves a weighted average of values of f(t,y) taken at diffe- 
rent points. Hence the sum 2[L, ,+2L,,+2L,3,+ L;, 4] can be interpreted 
as an average slope It can be shown that the error | y(t,)—y,| is at most a 
fixed constant times h*. Thus, the Runge-Kutta method is much more ac- 
curate than Euler’s method, the three term Taylor series method and the 
improved Euler method. 


Example 1. Let y(7) be the solution of the initial-value problem 


dy 2 1 
Salty y=}, 


Use the Runge-Kutta method to find approximate values y,,...,yy of y at 
the points  =k/N, k=1,...,N. 

Solution. Sample Pascal and Fortran programs to compute y,,..., yy by the 
Runge-Kutta method are given below. These programs differ from our 
previous programs in that they do not compute y, separately. Rather, they 
compute y, in the same “loop” as they compute y,,...,y,. This is accom- 
plished by relabeling the numbers f, and yy as ¢t, and y, respectively. 


Pascal Program 


Program Runge-Kutta (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, LK1, LK2, LK3, LK4: real; 
k, N: integer; 


begin 
readin(T[0], Y[0], a, N); 
h:=a/N; 
page, 
for k:=0 to N—1 do 
begin 
T[k+ 1] :=T[k] +h; 
LK1 :=1+(Y[k] — T[k]) « (Y[k] — T[k]); 
LK2:=1+ ((Y¥ [k] + (h/2) * LK1) — (T[k] + h/2)) 
* ((Y [k] + (h/2) * LK1) — (T[k] + h/2)); 


113 


1 First-order differential equations 


LK3 :=1+ ((Y[k] + (h/2) * LK2) — (T[k] + h/2)) 
* ((Y [k] + (h/2) * LK2) — (T[k] + h/2)); 
LK4:=1+((Y[k] +h *LK3) —(T[k] +h)) 
* ((Y[k] +h *LK3)— (T[k] +h)); 
Y(k+ 1] := Y[k] + (h/6) « (LK1+LK4+ 2+ (LK2+LK9)); 
end; 
writeln(‘T’:4, “Y’:16); 
for k :=0 to N do 
writeln(T (k]:10:7, ‘ ':2, Y[k]:16:9); 
end. 


Fortran Program 


DIMENSION T(1000), Y(1000) 
READ (5,10) T(1), Y(1),A,N 
10 FORMAT (3F20.8, !5) 
H=A/N 
DO20 K=1,N 
T(K+1)=T(K)+H 
REAL LK1, LK2, LK3, LK4 
LK1 =1+(Y(K)—T(K)) « #2 
LK2=1+((Y(K)+(H/2)*LK1)—(T(K) +H/2))* #2 
LK3=1+((Y(K)+(H/2)*LK2)—(T(K)+H/2)) * #2 
LK4=1+((Y(K) +H*LK3)—(T(K)+H))« #2 
¥(K +1) = Y(K)+(H/6)*(LK1 +LK4 +2 *(LK2+LK3)) 
20 CONTINUE 
NA=N+1 
WRITE (6,30) (T(J), Y(J), J=1, NA) 
30 FORMAT (1H1, 3X, 1HT, 4X, 1HY, /(1H, 1X, F10.7, 2X, F20.9/)) 
CALL EXIT 
END 


See also C Program 7 in Appendix C for a sample C program. 


Table 1 below shows the results of these computations for a=1, N= 10, 
tj=0, and y,=0.5. 


Table 1 


y y 
0.5 ; 1.31428555 
0.62631578 ; 1.4692305 
0.75555536 ; 1.6333329 


0.88823526 . 1.8090902 
1.02499993 1.9999988 
1.16666656 
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Notice how much closer the number y,,. = 1.9999988 computed by the 
Runge-Kutta method is to the correct value 2 than the numbers yj) = 
1.94220484, y,)=1.99572312, and y,)= 1.99770114 computed by the Euler, 
three term Taylor series and improved Euler methods, respectively. If we 
run this program for N=20 and N=40, we obtain that y,.= 1.99999992 
and y4)=2. Thus, our approximation of y(1) is already correct to eight dec- 
imal places when h=0.025. Equivalently, we need only choose N 2 40 to 
achieve eight decimal places accuracy. 

To put the accuracy of the various schemes into proper perspective, let 
us say that we have three different schemes for numerically solving the ini- 
tial-value problem dy /dt=f(t,y), y(0)=0 on the interval 0< ¢ < 1, and that 
the error we make in using these schemes is 3h, 11h, and 42h‘, respec- 
tively. If our problem is such that we require eight decimal places ac- 
curacy, then the step sizes h,, h,, and h, of these three schemes must satisfy 
the inequalities 3h, < 107°, 11h3 < 107°, and 42h} < 107°. Hence, the num- 
ber of iterations N,, N,, and N, of these three schemes must satisfy the in- 
equalities 


N, 23 10° =300,000,000, N,>V11 x 10*%34,000 


and 1/4 
N, >(42)'/4 x 102260. 


This is a striking example of the difference between the Runge-Kutta 
method and the Euler, improved Euler and three term Taylor series 
methods. 


Remark. It should be noted that we perform four functional evaluations at 
each step in the Runge-Kutta method, whereas we only perform one func- 
tional evaluation at each step in Euler’s method. Nevertheless, the 
Runge-Kutta method still beats the heck out of Euler’s method, the three 
term Taylor series method, and the improved Euler method. 


EXERCISES 


Using the Runge-Kutta method with h=0.1, determine an approximate 
value of the solution at t=1 for each of the initial-value problems 1-5. Re- 
peat these computations with h=0.025 and compare the results with the 
given value of the solution. 


1. dy/dt=1+t-y, yO)=0; (y()=d) 
2. dy/dt=2y, yO=2; (y(t)=2e") 
3. dy/dt=1+y?-77, yO=0; (V(D=d 
4. dy/dt=te’+t/(1+?2), yO)=0; ((d=n(1+2) 
5. dy/dt=—14+2t+y?/(1+2)7), yOH=H1L; (VD=14+2) 
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6. Using the Runge-Kutta method with h= 7/40, determine an approximate value 
of the solution of the initial-value problem 


O =2sec2t—(1+y?), y(0)=0 
at t=7/4. Repeat these computations with h=7/160 and compare the results 
with the number one which is the value of the solution y(t)=tant at t=7/4. 


1.17 What to do in practice 


In this section we discuss some of the practical problems which arise when 
we attempt to solve differential equations on the computer. First, and fore- 
most, is the problem of estimating the error that we make. It is not too dif- 
ficult to show that the error we make using Euler’s method, the three term 
Taylor series method, the improved Euler method and the Runge-Kutta 
method with step size h is at most c,h, c,h?, c,h’, and c,h*, respectively. 
With one exception, though, it is practically impossible to find the con- 
stants C,, C>, C3, and c,. The one exception is Euler’s method where we can 
explicitly estimate (see Section 1.13.1) the error we make 1n approximating 
y(t,) by y,. However, this estimate is not very useful, since it is only valid 
for ¢, sufficiently close to ¢), and we are usually interested in the values of 
y at times ¢ much larger than f). Thus, we usually do not know, a priori, 
how small to choose the step size h so as to achieve a desired accuracy. We 
only know that the approximate values y, that we compute get closer and 
closer to y(t,) as h gets smaller and smaller. 

One way of resolving this difficulty is as follows. Using one of the 
schemes presented in the previous section, we choose a step size h and 
compute numbers y,,...,¥y. We then repeat the computations with a step 
size h/2 and compare the results. If the changes are greater than we are 
willing to accept, then it is necessary to use a smaller step size. We keep 
repeating this process until we achieve a desired accuracy. For example, 
suppose that we require the solution of the initial-value problem y’ = f(t,y), 
y(0)=y_ at t=1 to four decimal places accuracy. We choose a step size h 
= 1/100, say, and compute y,,...,¥199. We then repeat these computations 
with h=1/200 and obtain new approximations Z,,...,Z99- If Y¥i99 and Z99 
agree in their first four decimal places then we take Zz.) as our approxima- 
tion of y(1).* If ¥499 and 299 do not agree in their first four decimal places, 
then we repeat our computations with step size h= 1/400. 


Example 1. Find the solution of the initial-value problem 


“This does not guarantee that z9) agrees with y(1) to four decimal places. As an added pre- 
caution, we might halve the step size again. If the first four decimal places still remain un- 
changed, then we can be reasonably certain that z59) agrees with y(1) to four decimal places. 
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d 
—=y(Ite)+e',  y(0)=0 
at t=1 to four decimal places accuracy. 

Solution. We illustrate how to try and solve this problem using Euler’s 
method, the three term Taylor series method, the improved Euler method, 
and the Runge-Kutta method. 


(1) Euler’s method: 
Pascal Program 


Program Euler (input, output); 


var 
T, Y: array[0..999] of real; 
a, h: real; 
k, N: integer; 


begin 
readin(T [0], Y [0], a, N); 
h:=a/N; 
page, 
for k:=0 to N—1 do 
begin 
T(k+ 1] :=T[k] +h; 
Y[k +1] :=Y[k] +h « (Y[k] «(1 + exp(— Y[k])) + exp(T[k])); 
end; 
writeln(‘N’:4, ‘h’:10, ‘Y[N]’:20); 
writeln(N:4, ‘ °:2,:10:7,° °:2, ¥Y(INJ:18:10); 


end. 

Fortran Program 

DIMENSION 1T(1000), Y(1000) 
Section A READ (5, 10) T(1), Y(1),A,N, 
Read in data 10 FORMAT (3F20.8, I5) 

H=A/N 

D020 K=1,N 
Section B T(K+1)=T(K)+H 
Do computations Y(K+1)= Y(K)+H *(Y(K) *(1 + EXP(— Y(K))) 

1 + EXP(T(K))) 
20 CONTINUE 

WRITE (6, 30) N,H, Y(N +1) 
Section C 30 FORMAT (1H, 1X, 15, 2X, F10.7, 4X, F20.9) 
Print out CALL EXIT 
results END 
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See also C Program 8 in Appendix C for a sample C program. 
We set A=1, T[0]=0, Y[O] =0(T(1)= Y(1)=0 in the Fortran program) and 
ran these programs for N = 10, 20, 40, 80, 160, 320, and 640. The results of 
these computations are given in Table 1. Notice that even with a step size h 


Table 1 


0.1 2.76183 168 
0.05 2.93832741 
0.025 3.03202759 


0.0125 3.08034440 
0.00625 3.10488352 
0.003125 3.11725009 
0.0015625 3.12345786 


as small as 1/640, we can only guarantee an accuracy of one decimal 
place. This points out the limitation of Euler’s method. Since N is so large 
already, it is wiser to use a more accurate scheme than to keep choosing 
smaller and smaller step sizes h for Euler’s method. 


(11) The three term Taylor series method 


Pascal Program 


Program Taylor (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, DY1, DY2: real; 
k, N: integer; 


begin 
readin(T [0], Y (OJ, a, N); 
h:=a/N; 
page, 
for k:=0 to N—1 do 
begin 
T[(k+ 1] :=T[k] +h; 
DY1 :=1+(1—Y{[k]) * exp(— Y[k]); 
DY2 :=Y [k] * (1+ exp(— Y[k])) + exp(T[k]); 
Y(k+ 1] := Y[k] +h « DY2 + (h + h/2) « (exp(T[k]) + DY1 « DY2); 
end; 
writeln(‘N’:4, ‘h’:10, “Y[N]’:20); 
writeln(N:4,° °:2,h:10:7,° 7:2, Y[N]:18:10); 
end. 
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Fortran Program 


Replace Section B of the previous Fortran program by 


DO20 K=1,N 

T(K+1)=T(K)+H 

DY1=1+(1—Y(K))*EXP(— Y(K)) 

DY2 = Y(K) «(1+ EXP(— Y(K))) + EXP(T(K)) 

Y(K+1)=Y(K)+H*DY2+(H*H /2) *(EXP(T(K)) + DY1 + DY2) 
20 CONTINUE 


See also C Program 9 in Appendix C for a sample C program. 
We set A=1, T[0]=0, and Y[0]=0 (T(1)=0, and Y(1)=0 in the Fortran 
program) and ran these programs for N= 10, 20, 40, 60, 80, 160, and 320. 
The results of these computations are given in Table 2. Observe that yj 


Table 2 


0.1 3.11727674 
0.05 3.12645293 


0.025 3.12885845 
0.0125 3.12947408 
0.00625 3.12962979 
0.003125 3.12966689 


and y39 agree in their first four decimal places. Hence the approximation 
y(1)=3.12966689 is correct to four decimal places. 


(iii) The improved Euler method 
Pascal Program 


Program Improved (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, R1, R2: real; 
k, N: integer; 


begin 
readin(T [0], Y [0], a, N); 
h:=a/N; 
page, 
for k:=0 to N—1 do 
begin 
T(k+ 1] := T[k] +h; 
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R1:= Y [k] « (1+ exp(— Y[k])) + exp(T[k]); 
R2:=(Y[k] +h*R1) «(1+ exp(—(Y[k] +h*R1))) +exp(T[k+ 1]); 
Y(k+ 1] := Y[k] + (h/2) * (R1+ R2); 
end; 
writeln(‘N’:4, ‘h’:10, ‘Y[N]’:20); 
writeln(N:4,* ':2,h:10:7,‘ °:2, Y[N]:18:10); 
end. 


Fortran Program 


Replace Section B of the first Fortran program in this section by 


D020 K=1,N 
T(K+1)=T(K)+H 
R1 =Y(K) «(1 + EXP(— Y(K))) + EXP(T(K)) 
R2=(Y(K)+H«*R1)*(1 +EXP(—(Y(K)+H*R1)))+EXP(T(K + 1)) 
Y(K +1)=Y(K)+(H/2)*(R1+R2) 

20| | CONTINUE 


See also C Program 10 in Appendix C for a sample C program. 
We set A= 1, T[0]=O and Y[0]=0(T(1)=0 and Y(1)=0 in the Fortran pro- 
gram) and ran these programs for N= 10, 20, 40, 80, 160, and 320. The re- 
sults of these computations are given in Table 3. Observe that y,¢9 and 525 


Table 3 


h YN 
0.1 3.11450908 
0.05 3.12560685 


0.025 3.1286243 

0.0125 3.12941247 
0.00625 3.12961399 
0.003125 3.12964943 


agree in their first four decimal places. Hence the approximation y(1)= 
3.12964943 is correct to four decimal places. 


(iv) The method of Runge-Kutta 
Pascal Program 


Program Runge—Kutta (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, LK1, LK2, LK3, LK4: real; 
k, N: integer; 
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begin 
readin(T[0], Y[0], a, N); 
h:=a/N; 
page; 
for k:=0 to N—1 do 
begin 
T(k+ 1] :=T[k] +h; 
LK1 :=(Y[k] « (1 + exp(— Y[k])) + exp(T[k]); 
LK2 :=(Y[k] + (h/2) * LK1) « (1+ exp(—(Y[k] + (h/2) * LK1))) 


+ exp(T[k] + (h/2)); 

LK3 :=(Y[k] + (h/2) « LK2) + (1 + exp(—(Y[k] + (h/2) * LK2))) 
+exp(T[k] + (h/2)); 

LK4 :=(Y[k] +h * LK3) « (1+ exp(—(Y[k] +h *LK3))) 
+exp(T[(k+ 1]); 

Y[k +1] :=Y[k] + (h/6) « (LK1+2*LK2+2*LK3+LK4); 


end; 
writeln(‘N’:4, ‘h’:10, ‘Y[N]’:20); 
writeln(N:4, ‘ ’:2,:10:7,° °:2, Y[IN]:18:10); 
001 10 


Fortran Program 


Replace Section B of the first Fortran program in this section by 


DO20 K=1,N 
T(K+1)=T(K)+H 
LK1 = Y(K)*(1 +EXP(— Y(K)) + EXP(T(K)) 
LK2=(Y(K)+(H/2)*LK1)*(1 +EXP(—(Y(K)+(H/2)*LK1))) 
1 | +EXP(T(K)+(H/2)) 
LK3=(Y(K)+(H/2) *LK2) «(1 +EXP(— (¥(K) +(H/2)*LK2))) 
1 | +EXP(T(K)+(H/2)) 
LK4=(Y(K) +H*LK3) *(1+EXP(—(Y(K)+H*LK3))) 
1 | +EXP(T(K +1)) 
20 Y(K + 1)=Y(K)+(H/6)#(LK1+2*LK2+2*LK3+LK4) 
CONTINUE 


See also C Program 11 in Appendix C for a sample C program. 
We set A=1, T[0] =O and Y[0]=0(7T(1)=0 and Y(1)=01n the Fortran pro- 
gram) and ran these programs for N = 10, 20, 40, 80, 160, and 320. The re- 
sults of these computations are given in Table 4. Notice that our approxi- 
mation of y(1) 1s already correct to four decimal places with h=0.1, and 
that it is already correct to eight decimal places with h =0.00625(N = 160). 
This example again illustrates the power of the Runge-Kutta method. 

We conclude this section with two examples which point out some addi- 
tional difficulties which may arise when we solve initial-value problems on 
a digital computer. 
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Table 4 


0.1 3.1296517 


0.05 3.12967998 

40 0.025 3.1296819 
80 0.0125 3.12968203 
160 0.00625 3.12968204 


0.003125 3.12968204 


Example 2. Use the Runge-Kutta method to find approximate values of 
the solution of the initial-value problem 
dy 
dt 
at the points ,=k/N, k=1,...,N. 
Solution. 


=t+y, y(0)=1 


Pascal Program 


Program Runge_kKutta (input, output); 


var 
T, Y: array[0..999] of real; 
a, h, LK1, LK2, LK3, LK4: real; 
k, N: integer; 


begin 
readin(T[0], Y[O], a, N); 
h:=a/N; 
page; 
for k:=0 to N—1 do 
begin 
T[k+ 1] :=T[k] +h; 
LK1 :=T[k] * T[k] + Y[k] * Y[k]; 
LK2 :=(T[k] + h/2) « (T[k] + h/2) 
+ (Y¥[k] + (h/2) * LK1) « (Y[k] + (h/2) * LK1); 
LK3 :=(T[k] + h/2) « (T[k] +h/2) 
+ (Y [k] + (h/2) * LK2) « (Y [k] + (h/2) * LK2); 
LK4 :=(T[k] +h) * (T[k] +h) 
+ (Y¥[k] +h * LK3) « (Y[k] +h « LK3); 
Y[(k+ 1) :=Y[k] + (h/6) « (LK1+2+*LK24+2+LK34LK4); 
end; 
writeln(‘T’:4, ‘Y’:15); 
for k:=0 to N do 
writeln(T [k]:10:7, ‘ °:2, Y[k]:16:9); 
end. 
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Fortran Program 


Replace Sections B and C of the first Fortran program in this section by 
DO20 K=1,N 
T(K+1)=T(K)+H 
LK1=T(K)* *2+Y(K)* *2 


Section B LK2=(T(K)+(H/2))* *2+(¥(K)+(H/2)*LK1)« #2 
Do computa- LK3=(T(K)+(H/2))* *2+(Y(K)+(H/2)*LK2)* #2 
tions LK4 =(T(K) +H) « #2+(Y(K)+H*LK3)« #2 


Y(K+1)=Y(K)+(H/6)*(LK1+2*LK2+2*LK3+LK4) 
20 CONTINUE 


NA=N+1 
WRITE (6, 30) (T(J), Y(J), J = 1, NA) 
SectionC  |30 FORMAT (1H1, 3X, 1HT, 4X, 1HY /(1H, 1X, F9.7, 


Print out 1 | 2X,F20.9/)) 
results CALL EXIT 
END 


See also C Program 12 in Appendix C for a sample C program. 
We attempted to run these programs with A= 1, T[0]=0, Y[0]=1(T(1) = 0, 
and Y(1)=1 in the Fortran program) and N= 10, but we received an error 
message that the numbers being computed exceeded the domain of the 
computer. That is to say, they were larger than 10°°. This indicates that the 
solution y(t) goes to infinity somewhere in the interval [0, 1]. We can prove 
this analytically, and even obtain an estimate of where y(t) goes to infinity, 
by the following clever argument. Observe that for O0<¢< 1, y(Z) is never 
less than the solution ¢,(t)=1/(1 — 2) of the initial-value problem 
d 
~ =y*, — y(0)=1. 


In addition, y(t) never exceeds the solution ¢$,(¢)=tan(t+ 7/4) of the ini- 
tial-value problem dy /dt=1+y7, y(0)=1. Hence, for 0<¢<1, 


fai < y(t) < tan(t+7/4). 


This situation is described graphically in Figure 1. Since $,(¢) and ¢,(t) be- 
come infinite at t=1 and t=7/4 respectively, we conclude that y(t) be- 
comes infinite somewhere between 7/4 and 1. 

The solutions of most initial-value problems which arise in physical and 
biological applications exist for all future time. Thus, we need not be 
overly concerned with the problem of solutions going to infinity in finite 
time or the problem of solutions becoming exceedingly large. On the other 
hand, though, there are several instances in economics where this problem 
is of paramount importance. In these instances, we are often interested in 
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y 
P(t )=tan(t+Z) y(t) 
NN ” 
P(t)= — 
t=2 t=! 
Figure 1 


determining whether certain differential equations can accurately model a 
given economic phenomenon. It is often possible to eliminate several of 
these equations by showing that they allow solutions which are unrealisti- 
cally large. 


Example 3. Use Euler’s method to determine approximate values of the 
solution of the initial-value problem 
d 


 =yly|-3/44- ¢sin@ = 
Hilo + esin], —-y(0) =0 (1) 


at the points 1/N,2/N,...,2. 
Solution. The programming for this problem is simplified immensely by 
observing that 


J, y>do 
y|y|~3/4= (sgn y)|y|'/4, where sgny=; 0, y=0 


Pascal Program 


Program Euler (input, output); 


const 
Pl =3.141592654; 


var 
T, Y: array[0..999] of real; 
h: real; 
k, N: integer; 
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begin 


1.17 What to do in practice 


readin(N); 


page, 


‘= 2/N; 
T[1] :=h; 
Y [1] :=0; 
fork:=1to N—1do 


begin 
T{k+ 1] :=T[k] +h; 
if Y[k] =O then Y[k+ 1]:=h~* T[k] «sin(PI/T[k]) else 
Y(k+ 1] := Y[k] +h « (Y[k] « exp ((— 3/4) « In(abs(Y [k]))) 


+ T[k] « sin(PI/T [k])); 


writeln(‘T’:4, ‘Y’:15); 
for k :=1toNdo 


end. 


10 


writelIn(T[k]:10:7, ‘ ’:2, Y[k]:16:9); 


Fortran Program 


DIMENSION T(1000), Y(1000) 

READ (5,10) N 

FORMAT (15) 

H=2/N 

T(1)=H 

¥(1)=0 

DO20 K=2,N 

T(K)=T(K—1)+H 

Y(K) =Y(K—1) +H « (SIGN(Y(K—1)) « ABS(Y(K —1)) «+ 0.25 
+T(K—1) *SIN(3.141592654/T(K — 1))) 

CONTINUE 

WRITE(6, 30) 0, 0, (T(J), Y(J), J=1,N) 

FORMAT (1H1, 3X, 1HT, 4X, 1HY /(1H, 1X, F10.7, 2X, F20.9/)) 

CALL EXIT 

END 


See also C Program 13 in Appendix C for a sample C program. 
When we set N =25 we obtained the value 2.4844172 for y(2), but when we 
set N =27, we obtained the value —0.50244575 for y(2). Moreover, all the 
y, Were positive for N=25 and negative for N=27. We repeated these 
computations with N=89 and N=91 and obtained the values 2.64286349 
and — 0.6318074 respectively. In addition, all the y, were again positive for 
N=89 and negative for N=91. Indeed, it is possible, but rather difficult, 
to prove that all the y, will be positive if N=1,5,9, 13,17,... and negative 
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if N =3,7,11,15,.... This suggests that the solution of the initial-value 
problem (1) is not unique. We cannot prove this analytically, since we 
cannot solve the differential equation explicitly. It should be noted though, 
that the existence—uniqueness theorem of Section 1.10 does not apply here, 
since the partial derivative with respect to y of the function |y|~?/4y+ 
tsina /t does not exist at y=0. 


Most of the initial-value problems that arise in applications have unique 
solutions. Thus, we need not be overly concerned with the problem of non- 
uniqueness of solutions. However, we should always bear in mind that ini- 
tial-value problems which do not obey the hypotheses of the existence— 
uniqueness theorem of Section 1.10 might possess more than one solution, 
for the consequences of picking the wrong solution in these rare instances 
can often be catastrophic. 


EXERCISES 


In each of Problems 1-5, find the solution of the given initial-value prob- 
lem at t=1 to four decimal places accuracy. 


dy _ -y _ yi, 2 _ 
1. wt te +21, y(0)=0 2. a! t+y*, y(0)=0 
dy _ tty? _ dy _ t,,2 = 
3. dt 14rty?’ y(0)=0 4. wm °” 2y, y(0)=1 
dy 


—_— 3 => 
5. =y3-y, yO=l 
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differential equations 


2.1 Algebraic properties of solutions 


A second-order differential equation is an equation of the form 


£m s(eo,®). (1) 


For example, the equation 


d*y dy \? 

We =sint+3y+(F] 
is a second-order differential equation. A function y= y(f) is a solution of 
(1) if »(¢) satisfies the differential equation; that is 


dy(t)_ i 510) 


dt? ty(t), dt ° 


Thus, the function y(¢)=cost is a solution of the second-order equation 
d*y /dt?=—y since d*(cost)/dt? = —cost. 

Second-order differential equations arise quite often in applications. 
The most famous second-order differential equation is Newton’s second 
law of motion (see Section 1.7) 


d*y dy 
mor = FW] 


which governs the motion of a particle of mass m moving under the in- 
fluence of a force F. In this equation, m is the mass of the particle, y = y(f) 
is its position at time ¢, dy /dt is its velocity, and F is the total force acting 
on the particle. As the notation suggests, the force F may depend on the 
position and velocity of the particle, as well as on time. 
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In addition to the differential equation (1), we will often impose initial 
conditions on y(t) of the form 


V(%)=Yo —-¥"(to) = Yo (1’) 
The differential equation (1) together with the initial conditions (1’) is re- 
ferred to as an initial-value problem. For example, let y(t)* denote the 
position at time ¢ of a particle moving under the influence of gravity. Then, 
y(Z) satisfies the initial-value problem 


y ' ’ 
—> ~ =—8; V(to)=Yo Y(t) =Yo» 


where yp is the initial position of the particle and yo is the initial velocity of 
the particle. 

Second-order differential equations are extremely difficult to solve. This 
should not come as a great surprise to us after our experience with first- 
order equations. We will only succeed in solving the special differential 
equation 

d d*y 

a2 +p) 7 ta(t)y =s(7). (2) 
Fortunately, though, many of the second-order equations that arise in ap- 
plications are of this form. 

The differential equation (2) is called a second-order linear differential 
equation. We single out this equation and call it linear because both y and 
dy /dt appear by themselves. For example, the differential equations 

d*y _,¥ 
ae +3t7- +(sint) y =e! 
and 
d*y _ Wh 
de +e dt +2y= =] 


are linear, while the differential equations 
d*y ad. 3 
2 +3 a +siny=t 


d* 2 
dt? dt 
are both nonlinear, due to the presence of the siny and (dy/dt)* terms, re- 


spectively. 
We consider first the a linear homogeneous equation 


ay +p) 7 + a(t)y =0 (3) 


*The positive direction of y is taken upwards. 


and 
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which is obtained from (2) by setting g(t)=0. It is certainly not obvious at 
this point how to find all the solutions of (3), or how to solve the initial- 
value problem 

d*y dy 

— tPF + a(t y=0; v(t) =Yo ¥'(to) = Vo (4) 

dt f 
Therefore, before trying to develop any elaborate procedures for solving 
(4), we should first determine whether it actually has a solution. This infor- 
mation is contained in the following theorem, whose proof will be indi- 
cated in Chapter 4. 


Theorem 1. (Existence—uniqueness Theorem). Let the functions p(t) and 
q(t) be continuous in the open interval a<t< B. Then, there exists one, 
and only one function y(t) satisfying the differential equation (3) on the en- 
tire interval a<.t< B, and the prescribed initial conditions y (to) = Yo; Y' (to) 
= yo. In particular, any solution y = y(t) of (3) which satisfies y (to) =0 and 
y'(to) =90 at some time t=t, must be identically zero. 


Theorem 1 is an extremely important theorem for us. On the one hand, 
it is our hunting license to find the unique solution y(¢) of (4). And, on the 
other hand, we will actually use Theorem | to help us find all the solutions 
of (3). 

We begin our analysis of Equation (3) with the important observation 
that the left-hand side 


dt? 


of the differential equation can be viewed as defining a “function of a 
function”: with each function y having two derivatives, we associate 
another function, which we'll call L[y], by the relation 


Ll yJO=y"(Ote Oy (+ aor). 


In mathematical terminology, L is an operator which operates on func- 
tions; that is, there is a prescribed recipe for associating with each function 
y a new function L[ y]. 


” _& ,_ ay 
reor+aoy (v=. 


Example 1. Let p(t)=0 and g(t)=1t. Then, 
Ll y ](t)=y"(t) + (2). 
If y(t)=cost?, then 
L| y |(t) =(cost)” + tcost=(t—1)cost, 
and if y(¢)= 72°, then 
LI y](Q=(0)" + t(08) = 14 + 61. 
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Thus, the operator L assigns the function (t— 1)cost¢ to the function cosf, 
and the function 6¢+ ¢* to the function ?¢°. 

The concept of an operator acting on functions, or a “function of a 
function” is analogous to that of a function of a single variable ¢. Recall 
the definition of a function f on an interval J: with each number ¢ in J we 
associate a new number called f(r). In an exactly analogous manner, we 
associate with each function y having two derivatives a new function called 
L{y]. This is an extremely sophisticated mathematical concept, because in 
a certain sense, we are treating a function exactly as we do a point. 
Admittedly, this is quite difficult to grasp. It’s not surprising, therefore, 
that the concept of a “function of a function” was not developed till the 
beginning of this century, and that many of the “high powered” theorems 
of mathematical analysis were proved only after this concept was 
mastered. 

We now derive several important properties of the operator L, which we 
will use to great advantage shortly. 


Property 1. L[cy]=cL[y], for any constant c. 
Proor. L{cy|(t)=(cv)"() + p(lov)'() + ao (4) 
=cy"(t)+ cp(t)y(t) + cq(t) y(2) 
=cl y"(t)t+p(t)y'(t)+ a(t) y(Z) | 
=cL| y](t). CL 


The meaning of Property | is that the operator L assigns to the function 
(cy) c times the function it assigns to y. For example, let 


LI y ](t)=y"(t) + 6y'(t) - 2y(2). 
This operator L assigns the function 
(17)” +6(07)’ —2(t7)=24+ 121-277 
to the function ¢?. Hence, L must assign the function 5(2+ 121-22) to the 
function 577. 
Property 2. L[y,+y.]=L[y,]+ Liy]. 
PROOF. 
LI vit yz) = (yy t92)"(O) + POO +92) + (Ot) 
=yi ty Ot POnOtPOnO+aOvOt+ a) y2() 
=[y/'OteOnO+ay(O]+lyo7O+e(Oxx(9O+ 4a) y2(9 | 
=Ll y,]()+ Ly | (2). 0 


The meaning of Property 2 is that the operator L assigns to the function 
y,+ y, the sum of the functions it assigns to y, and y,. For example, let 


LI y |\(t)=y"(t) +2y'(t) —y (2). 
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This operator L assigns the function 
(cost)” +2(cost)’—cost= —2cost—2sint 
to the function cost, and the function 
(sint)” + 2(sint)’—sint=2cost—2sint 
to the function sin¢. Hence, L assigns the function 
(—2cost—2sint)+2cost—2sint=—4sin¢ 


to the function sint+cost. 


Definition. An operator L which assigns functions to functions and which 
satisfies Properties 1 and 2 is called a linear operator. All other opera- 
tors are nonlinear. An example of a nonlinear operator is 


LL y]()=y"()-2¢eL yO’. 


This operator assigns the function 


1 \” 1) 2 2_ 
(7) u( 7) =4 n° 
to the function 1/t, and the function 
Cc Mt 
() Bp Bp P 


to the function c/t. Hence, for c#0,1, and y(t)=1/t, we see that L[cy] 
#cL{[y}. 


2*(£) = 2c ct 2e(1-c’) 


The usefulness of Properties 1 and 2 lies in the observation that the 
solutions y(t) of the differential equation (3) are exactly those functions y 
for which 


Li y|[=yv"(O+p(Oy'(t) +42) y(t) =0. 


In other words, the solutions y(t) of (3) are exactly those functions y to 
which the operator L assigns the zero function.* Hence, if y(t) is a solution 
of (3) then so is cy(t), since 


L| cy |(t)=cL[ y ](t) =0. 
If y,(t) and y,(¢) are solutions of (3), then y,(t)+,y,(Z) is also a solution of 
(3), since 


Ll yy, ty. |(O=L[ 1 O+ L[ v2 ]() =04+0=0. 
Combining Properties 1 and 2, we see that all linear combinations 


C1 Y(t) +c yo (2) 
of solutions of (3) are again solutions of (3). 


*The zero function is the function whose value at any time ¢ is zero. 
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The preceding argument shows that we can use our knowledge of two 
solutions y,(t) and y,(t) of (3) to generate infinitely many other solutions. 
This statement has some very interesting implications. Consider, for exam- 
ple, the differential equation 


—+y=0. (5) 


Two solutions of (5) are y,(¢)=cost and y,(¢)=sin¢. Hence, 
y(t)=c,cost+c,sint (6) 
is also a solution of (5), for every choice of constants c, and c,. Now, 
Equation (6) contains two arbitrary constants. It is natural to suspect, 
therefore, that this expression represents the general solution of (5); that is, 
every solution y(t) of (5) must be of the form (6). This is indeed the case, 
as we now show. Let y(t) be any solution of (5). By the existence—unique- 


ness theorem, y(¢) exists for all ¢. Let y(0)= yo, y’(0)=yo, and consider the 
function 


o(t)=y,cost+yosint. 
This function is a solution of (5) since it is a linear combination of solu- 
tions of (5). Moreover, $(0)=y, and ¢’(0)=yo. Thus, y(t) and 9$(?) satisfy 
the same second-order linear homogeneous equation and the same initial 
conditions. Therefore, by the uniqueness part of Theorem 1, y(¢) must be 
identically equal to ¢(t), so that 


y(t)=yocost+yosint. 
Thus, Equation (6) is indeed the general solution of (5). 
Let us return now to the general linear equation (3). Suppose, in some 


manner, that we manage to find two solutions y,(¢) and y,(¢) of (3). Then, 
every function 


V(t)= C1 V1 (t) + ayo (t) (7) 
is again a solution of (3). Does the expression (7) represent the general 


solution of (3)? That is to say, does every solution y(t) of (3) have the form 
(7)? The following theorem answers this question. 


Theorem 2. Let y,(t) and y,(t) be two solutions of (3) on the intervala<t< 
B, with 
Yi(t)y2(t) Wi (4) y2(4) 
unequal to zero in this interval. Then, 
Y(t=ey y(t) + cy y2(t) 
is the general solution of (3). 


ProoF. Let y(t) be any solution of (3). We must find constants c, and c, 
such that y(‘)=c, y,(t)+ c,y,(¢). To this end, pick a time f, in the interval 
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(a, 6) and let yp and yg denote the values of y and y’ at ¢=¢). The constants 
c, and cp, if they exist, must satisfy the two equations 


C1 V1 (to) + 2.2 (to) = Yo 
C1 Y4 (to) + C2 ¥2(to) =o. 
Multiplying the first equation by y3(f9), the second equation by y,(ty) and 
subtracting gives 
¢1[ ¥1 (to) ¥2(to) — 1 (to) ¥2 (to) | = 0.2 (t0) — ¥O¥2 (to): 
Similarly, multiplying the first equation by yj(t,), the second equation by 
y,(to) and subtracting gives 


col yi (to) ¥2(to) 1 (19) ¥3(to) | = Yo (lo) — Yo 1 (40). 


Hence, 
o= YoV2(to) —YoV2(to) 
Y¥1(t)¥2(40) — V1 (lo) ¥2 (40) 
and 
_— Yo (40) — Voi (40) 
2 


¥1 (40) ¥2(t0) ~¥1 (40) ¥2 (40) 

if ¥1(to) ¥2(Lo) — V(t) ¥2(%o) #0. Now, let 
O(t)=c, y(t) + ey y2(0) 
for this choice of constants c,,c,. We know that ¢$(¢) satisfies (3), since it is 
a linear combination of solutions of (3). Moreover, by construction, $(t)) = 
Yo and $'(49)=yo. Thus, y(t) and $(¢) satisfy the same second-order linear 
homogeneous equation and the same initial conditions. Therefore, by the 
uniqueness part of Theorem 1, y(t) must be identically equal to o(¢); that 
1S, 
y(th=cy y(t) +c. y2(2), a<t<Bp. CJ 

Theorem 2 is an extremely useful theorem since it reduces the problem 
of finding all solutions of (3), of which there are infinitely many, to the 
much simpler problem of finding just two solutions y,(t),y,(t). The only 
condition imposed on the solutions y,(t) and y,(t) is that the quantity 
y(t) y(t) —y() y2(4) be unequal to zero for a<t< B. When this is the 
case, we say that y,(¢) and y,(¢) are a fundamental set of solutions of (3), 
since all other solutions of (3) can be obtained by taking linear combina- 
tions of y,(¢) and y,(?). 


Definition. The quantity y,(t)y3(4)— y(t) y,(0) is called the Wronskian of 
y, and y,, and is denoted by W(t)= W[y,,y](0). 


Theorem 2 requires that W[y,,y2](t) be unequal to zero at all points in 
the interval (a, 8). In actual fact, the Wronskian of any two solutions 
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y(2),y(0) of (3) is either identically zero, or is never zero, as we now 
show. 


Theorem 3. Let p(t) and q(t) be continuous in the interval a<t< B, and let 
y,(t) and y,(t) be two solutions of (3). Then, W[y,,¥>\(t) is either identi- 
cally zero, or is never zero, on the interval a<t< B. 


We prove Theorem 3 with the aid of the following lemma. 


Lemma 1. Let y,(t) and y,(t) be two solutions of the linear differential equa- 
tion y” + p(t)y’ + q(t)y =0. Then, their Wronskian 


W (t= WY y2](O=i (OY) 1 (Y2(2) 
satisfies the first-order differential equation 
W'+ p(t)W=0. 
ProoF. Observe that 
/ d , , 
W'(t)= 7 (12 - Vi Y2) 
=Yi V2 tViV2-Vi 2-2 
=Yi V2 —Vi dr 
Since y, and y, are solutions of y” + p(t) y’ + q(t) y =0, we know that 


x= —P(t)y2.—- (4) ¥2 
and 
Yi=—P(t)yi-a(t)y. 
Hence, 
W'(t)=y,| —P()2-4() 2] -y2[ —P(O 1-4), | 
=—p(t)[ viy2-Y1)2] 
= —p(t)W(t). 0 


We can now give a very simple proof of Theorem 3. 


PROOF OF THEOREM 3. Pick any ¢, in the interval (a,8). From Lemma 1, 
t 
W[ yipv2 (= W192 |(to)exp -f p(s)ds |}. 
fo 


t 
Now, exp( — f p(s) ds] is unequal to zero for a<t< fB. Therefore, 


WT[y,,¥2](2) 1s either identically zero, or is never zero. O 
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The simplest situation where the Wronskian of two functions y,(¢),2(¢) 
vanishes identically is when one of the functions is identically zero. More 
generally, the Wronskian of two functions y,(t),y,(¢) vanishes identically if 
one of the functions is a constant multiple of the other. If y,= cy), Say, 
then 


WI yo |(1)=¥1(01)' — 1 (01) =. 
Conversely, suppose that the Wronskian of two solutions y,(t),y(t) of (3) 
vanishes identically. Then, one of these solutions must be a constant multi- 
ple of the other, as we now show. 


Theorem 4, Let y,(t) and y,(t) be two solutions of (3) on the interval a<t< 
B, and suppose that W[y,,¥|(to) =0 for some ty in this interval. Then, one 
of these solutions is a constant multiple of the other. 


PrRooF #1. Suppose that W[y,,y,](t.)=0. Then, the equations 


C1 V1 (to) + C2. ¥a(to) =0 
C11 (to) + C2.¥2 (to) = 9 


have a nontrivial solution c,,c,; that is, a solution c,,c, with |c,|+|c,| 40. 
Let y(t)=c, y(t) +c, y,(0), for this choice of constants c,,c,. We know that 
y(t) is a solution of (3), since it is a linear combination of y,(¢) and y,(7). 
Moreover, by construction, y(¢))=0 and y’(t))=0. Therefore, by Theorem 
1, y(2) is identically zero, so that 


CY (+e, y,()=0, ax<t<B, 


If c, #0, then y,(t)= —(c,/c,)y,(t), and if c, #0, then y,(1)= 
—(c,/Cz)¥,(t). In either case, one of these solutions is a constant multiple 
of the other. O 


PROOF #2. Suppose that W[y,,y,](t9.)=0. Then, by Theorem 3, 
W[y1,¥2](2) is identically zero. Assume that y,(t)y,(t)#0 for a<t< B. 
Then, dividing both sides of the equation 


yi (t)y2(t) -—y1 (4) ¥2(4) =0 

by y,(t)y2(t) gives 

y2(t) vit) _ 

y(t) y(t) © 
This equation implies that y,(¢)= cy,(t) for some constant c. 

Next, suppose that y,(¢) y,(¢) is zero at some point ¢= ¢* in the interval 

a<t<£. Without loss of generality, we may assume that y,(t*)=0, since 
otherwise we can relabel y, and y,. In this case it is simple to show (see Ex- 


ercise 19) that either y,(¢)=0, or y,(t)=[y3(t*)/y\(¢*)]y,(). This com- 
pletes the proof of Theorem 4. O 


135 


2 Second-order linear differential equations 


Definition. The functions y,(t) and y,(¢) are said to be linearly dependent 
on an interval J if one of these functions is a constant multiple of the 
Other on J. The functions y,(t) and y,(t) are said to be linearly indepen- 
dent on an interval J if they are not linearly dependent on J. 


Corollary to Theorem 4. Two solutions y(t) and y,(t) of (3) are linearly in- 
dependent on the interval a<t< B if, and only if, their Wronskian is un- 
equal to zero on this interval. Thus, two solutions y(t) and y(t) form a 
fundamental set of solutions of (3) on the interval a<t< B if, and only if, 
they are linearly independent on this interval. 


EXERCISES 
1, Let LL y\(O=y"(t)—30'(t) + 3y(2t). Compute 
(a) Lie‘), (b) L[cos V3 ¢], (c) L[2e‘+4cos V3 f¢], 
(d) Le], (e) L[5z7, (f) L[¢], (g) L{t? +37]. 
2. Let LL y}()=y"(t)—6y’(t) + 5y(t). Compute 
(a) Lle’], (b) L{e~*], (c) L{e*‘], (d) Lie”), 
(e) L[¢], (f) L[t?], (g) L[t? +22]. 
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. Show that the operator L defined by 


Ll yl(t)= [2y) ds 


is linear; that is, L[cy]=cL[y] and L[y, +y2]= L[y,] + Ll yp]. 


. Let Li y\(O=y’(O+pOy (+ q()y(H), and suppose that L[r?]=¢+1 and 


L{t]}=2t+2. Show that y(t)=¢— 22? is a solution of y” + p(t)y’+ q(t) y =0. 


. (a) Show that y,()= Vz? and y,(t)=1/t are solutions of the differential equa- 


tion 
2t*y” +3t’—y=0 (*) 


on the interval 0<t< oo. 

(b) Compute W[y,,y,](t). What happens as ¢ approaches zero? 

(c) Show that y,(¢) and y,(¢) form a fundamental set of solutions of (*) on the 
interval 0< t< oo. 

(d) Solve the initial-value problem 27y” +3ty’—y =0; y(1)=2, y'(D=1. 


. (a) Show that y,())=e7"/? and y.()=e7?/ fier’ 2ds are solutions of 


y’+pn’+y=0 (*) 


on the interval —co<t<oo. 

(b) Compute W[y,,y2](2). 

(c) Show that y, and y, form a fundamental set of solutions of (*) on the inter- 
val —-o<t< oo. 

(d) Solve the initial-value problem y” + ty’+y =0; y(0)=0, y’(0)=1. 


10. 


11. 


12. 


13. 


14, 


2.1 Algebraic properties of solutions 


. Compute the Wronskian of the following pairs of functions. 


(a) sinat,cos bt (b) sin??, 1—cos2¢ 
(c) et et (d) e® tet 
(e) ¢,¢ln¢ (f) e” sin bt,e* cos bt 


. Let y,(¢) and y,(t) be solutions of (3) on the interval — 00 <t< oo with y,(0)= 


3, y(0) = 1, y,(0) = 1, and y3(0) = +. Show that y,(¢) and y,(‘) are linearly depen- 
dent on the interval — 0 <t<oo. 


. (a) Let y,(t) and y,(t) be solutions of (3) on the interval a<t< B, with y,(%)= 


1, yi(to) = 9, yo(to) =0, and y3(to)= 1. Show that y,(¢) and y,(t) form a 
fundamental set of solutions of (3) on the interval a<t< B. 

(b) Show that y()=yoy,(t)+y9 y(t) is the solution of (3) satisfying y (to) = Yo 
and y’(to) = Yo. 


Show that y(‘)=?* can never be a solution of (3) if the functions p(t) and q(t) 
are continuous at t=0. 


Let y,(Q=2 and y,(t)=¢l¢\. 

(a) Show that y, and y, are linearly dependent on the interval 0<¢< 1. 

(b) Show that y, and y, are linearly independent on the interval —1 <?¢<l. 

(c) Show that W[y,,y](t) is identically zero. 

(d) Show that y, and y, can never be two solutions of (3) on the interval —1l<t 
<1 if both p and q are continuous in this interval. 


Suppose that y, and y, are linearly independent on an interval J. Prove that z, 
=y,+y, and z,=y,—y, are also linearly independent on /. 


Let y, and y, be solutions of Bessel’s equation 
t?y” + ty’ + (127-1?) y=0 


on the interval 0< t< oo, with y,(1)=1, y}(1)=0, y.(1)=0, and y3(1)=1. Com- 
pute W[y,,y2](¢). 


Suppose that the Wronskian of any two solutions of (3) is constant in time. 
Prove that p(t)=0. 


In Problems 15-18, assume that p and g are continuous, and that the func- 
tions y, and y, are solutions of the differential equation 


y"+p(t)y’+q(t)y =0 


on the interval a<¢t< B. 


15. 


16. 


17. 


Prove that if y,; and y vanish at the same point in the interval a<t< B, then 
they cannot form a fundamental set of solutions on this interval. 


Prove that if y, and y, achieve a maximum or minimum at the same point in 
the interval a<t< B, then they cannot form a fundamental set of solutions on 
this interval. 


Prove that if y, and y, are a fundamental set of solutions, then they cannot 
have a common point of inflection in a<t< B unless p and q vanish simulta- 
neously there. 
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18. Suppose that y, and y, are a fundamental set of solutions on the interval — oo 
<t<oo. Show that there is one and only one zero of y, between consecutive 
zeros Of y». Hint: Differentiate the quantity y./y, and use Rolle’s Theorem. 


19. Suppose that W[y,,y2](t*)=0, and, in addition, y,(t*)=0. Prove that either 
¥\(t)=0 or y(t) =[y2(t*)/yi(t*) yi). Hint: If W[y1,y2\(t*)=0 and y,(2*) =0, 
then y2(t*) yi(t*) =0. 


2.2 Linear equations with constant coefficients 


We consider now the homogeneous linear second-order equation with con- 
stant coefficients 


d 
L[y]=a75 + mn t1v79 (1) 


where a, b, and c are constants, with a#0. Theorem 2 of Section 2.1 tells 
us that we need only find two independent solutions y, and y, of (1); all 
other solutions of (1) are then obtained by taking linear combinations of y, 
and y,. Unfortunately, Theorem 2 doesn’t tell us how to find two solutions 
of (1). Therefore, we will try an educated guess. To this end, observe that a 
function y(f) is a solution of (1) if a constant times its second derivative, 
plus another constant times its first derivative, plus a third constant times 
itself is identically zero. In other words, the three terms ay”, by’, and cy 
must cancel each other. In general, this can only occur if the three func- 
tions y(t), y(t), and y”(¢) are of the “same type”. For example, the func- 
tion y(t)=2° can never be a solution of (1) since the three terms 20az?, 
5bt*, and ct? are polynomials in ¢ of different degree, and therefore cannot 
cancel each other. On the other hand, the function y(t)=e”, r constant, 
has the property that both y’(t) and y”(¢) are multiples of y(t). This sug- 
gests that we try y(#)=e” as a solution of (1). Computing 


L[ e”|=a(e")” + b(e”)' + c(e”) 
=(ar?+br+c)e", 
we see that y(t)=e” is a solution of (1) if, and only if 
ar? + br+c=0. (2) 


Equation (2) is called the characteristic equation of (1). It has two roots 
r),r, given by the quadratic formula 
_ b+ Vb —4ac = b- Vb? 4ae 
2a 2 2a 
If b*—4ac is positive, then r, and r, are real and distinct. In this case, y,(¢) 
=e" and y,(f)=e’ are two distinct solutions of (1). These solutions are 
clearly linearly independent (on any interval J), since e’” is obviously not 
a constant multiple of e’” for r,#r,. (If the reader is unconvinced of this 


138 


2.2 Linear equations with constant coefficients 


he can compute 
We” ,e7"] =(r)- rent, 
and observe that W is never zero. Hence, e’ and e”” are linearly indep2n- 


dent on any interval J.) 


Example 1. Find the general solution of the equation 


d’y .& 

—+5— +4y=0. 3 

de +5 ds (3) 
Solution. The characteristic equation r?+5r+4=(r+4)(r+1)=0 has two 
distinct roots r, = —4 and r,= — 1. Thus, y,(t)=e “ and y,(¢)=e‘ form a 


fundamental set of solutions of (3), and every solution y(t) of (3) is of the 
form 


y(th=cye"*+c,e7' 


for some choice of constants c,,¢5. 


Example 2. Find the solution y(t) of the initial-value problem 


dy WP 
Wea 8% POR=I v'(O)=2. 


Solution. The characteristic equation r?+4r—2=0 has 2 roots 


=4+VI6H8 _ oy Ve 


l 
and 


_ -4-VI68 og vg 


lo = ) 


Hence, y,(¢)=e’" and y,(t)=e” 
y” +4y’—2y =0, so that 


t t 


are a fundamental set of solutions of 


y(t) =c,e(-2+ V6 eg c,e(-2- V6) 


for some choice of constants c,,c,. The constants c, and c, are determined 
from the initial conditions 


From the first equation, c)=1—c,. Substituting this value of c, into the 
second equation gives 


(-24+ V6 je, -(2+ V6 )(1—c,)=2, or 2V6c,=44+ V6. 
Therefore, c,=2/V6 +3, c,=1-—c,;=4-2/V6, and 


~f{1,_2 \,(-2+v6y4(1_ 2 \,-+vé): 
y(t) (s+ = }e +(5 ai : 
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EXERCISES 


Find the general solution of each of the following equations. 


dy dy _dy 
—+-y=0 2. 6 a tY=9 


d*y 
3. —-—-3—+y=0 4, 372 og t= 9 


1. 


Solve each of the following initial-value problems. 


d*y dy 
5, sa 3 749 =O y(0)=1, y’(0)=0 


d*y wy 


d*y wd 
7. Sao +5 y=: y(0)=0, y’(0)=1 


2 
8. —— -6—+y=0; y(2)=1, y'(2)=1 


Remark. In doing Problems 6 and 8, observe that e’“~ is also a solution 
of the differential equation ay” + by’+cy =0 if ar?+ br+c=0. Thus, to 
find the solution y(f) of the initial-value problem ay” + by’+ cy =0; y(t) = 
Vor (to) =Vo, We would write y(1)=c,e"~ + c,e"'—) and solve for c, 
and c, from the initial conditions. 


9. Let y(t) be the solution of the initial-value problem 


d*y ad 
We dt 1 Y= % y(O=1, yYOEH=V. 


For what values of V does y(t) remain nonnegative for all ¢>0? 
10. The differential equation 
Li y]=t’y” + aty’+ By =0 (*) 


is known as Euler’s equation. Observe that ¢7y”, ty’, and y are all multiples of 
t’ if y=¢’. This suggests that we try y=?’ as a solution of (*). Show that y=’ 
is a solution of (*) if r?+(a—1)r+B=0. 


11. Find the general solution of the equation 
t?7y" +54’ —5y =0, t>0 
12. Solve the initial-value problem 
Py"—p’—-2y=0;  y(I)=0, y'(I)=1 
on the interval 0<t< oo. 
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2.2.1 Complex roots 


If b?—4ac is negative, then the characteristic equation ar?+ br+c=0 has 
complex roots 


= —~b+ivV 4ac— b? —~b-iV 4ac— b? 
2a 2a 


We would like to say that e”’ and e”™ are solutions of the differential 
equation 


and r,= 


d? 
as +62 +cy =0. (1) 
However, this presents us with two serious difficulties. On the one hand, 
the function e” is not defined, as yet, for r complex. And on the other 
hand, even if we succeed in defining e”” and e’” as complex-valued solu- 
tions of (1), we are still faced with the problem of finding two real-valued 
solutions of (1). 
We begin by resolving the second difficulty, since otherwise there’s no 
sense tackling the first problem. Assume that y(t)=u(¢)+ iv(¢) 1s a com- 
plex-valued solution of (1). This means, of course, that 


al u”(t)+ iv” (t) | + b| u'(t) + iv’(t) | + c| u(t)+ iv(t) | = 0). (2) 


This complex-valued solution of (1) gives rise to two real-valued solutions, 
as we now show. 


Lemma 1. Let y(t)=u(t)+ iv(t) be a complex-valued solution of (1), with a, 
b, and c real. Then, y,(t)= u(t) and y,(t)= v(t) are two real-valued solu- 
tions of (1). In other words, both the real and imaginary parts of a com- 
plex-valued solution of (1) are solutions of (1). (The imaginary part of the 
complex number a+ iB is 8. Similarly, the imaginary part of the function 
u(t) + iv(t) is v(t).) 


Proor. From Equation (2), 
[ u(t) + bu’(t) + cu(t) | + i[ av” (t) + bo’(t) + cv(t) |] =0. (3) 


Now, if a complex number is zero, then both its real and imaginary parts 
must be zero. Consequently, 


au"(t)+ bu'(t)+cu(t)=0 and av’(t)+ bv'(t)+cv(t)=9, 
and this proves Lemma I. 0 
The problem of defining e” for r complex can also be resolved quite 
easily. Let r=a+ if. By the law of exponents, 
el = ee iBr (4) 
Thus, we need only define the quantity e’*’, for B real. To this end, recall 


141 


2 Second-order linear differential equations 


that 
2 3 
ealtxt tate. (5) 
Equation (5) makes sense, formally, even for x complex. This suggests that 
we set 


D4\2 -Q4\3 
e'P! =| + iBt+ “ + “ + 


Next, observe that 


(pty BY p38 Bat ip r? 
1+ iptt+ 1 +...=1+iBt- Tay + + 51 + 
Bt? B4At4 . Bt? B°t? 
=|l- Sp tarot |t Bt- tap te 
=cos 6t+isin Bt. 
Hence, 
ele + iB)t = ele iBt = e%! (cos Br + isin Bt). (6) 


Returning to the differential equation (1), we see that 


y(t)= ol—b+iV4ac— B? ]t/2a 


= e~ /24| cos V 4ac— b? t/2a+t isin V 4ac — b? t/2a| 


is a complex-valued solution of (1) if b?—4ac is negative. Therefore, by 
Lemma lI, 


V 4ac — b? 
2a 


are two real-valued solutions of (1). These two functions are linearly inde- 
pendent on any interval J, since their Wronskian (see Exercise 10) is never 
zero. Consequently, the general solution of (1) for b*—4ac <0 is 


V 4ac — b? 


2a 


y, (t)=e7"/*4cosBt and y,(t)=e7"/4sinBt, B= 


y(t)=e "40 c.cosBtt+c,sinBt], B= 


Remark 1. Strictly speaking, we must verify that the formula 


d 
—el — re™ 


dt 


is true even for r complex, before we can assert that e”’ and e” are com- 
plex-valued solutions of (1). To this end, we compute 


da (a+ ipyt_ 4 at - os 
ae ae [ cos Br + isin Bt | 
= e*"| (acos Bt— Bsin Bt) + i(asin Bt + Bcos Bt) | 
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and this equals (a+ iB)e@*'”, since 
(a+ iB )e** = (a+ iB je“ | cos Bt + isin Br] 
= el (acosBt— Bsin Bt) + i(asin Bt + Bcos Br) |. 


Thus, (d/dt)e"=re", even for r complex. 


Remark 2. At first glance, one might think that e’” would give rise to two 
additional solutions of (1). This is not the case, though, since 


eft! = e— (b/2a)tg — iBt. B= V 4ac — b2 /2a 
= e~"1/24) cos( — Br) + isin(— Bt) |= e~"/?4[ cos Bt — isin Br]. 
Hence, 
Re{e™ } =e~"/**cos Bt=y,(t) 
and 
Im{e" } = —e7"/4sin Bt = — y(t). 


Example 1. Find two linearly independent real-valued solutions of the dif- 
ferential equation 


d’y _, 
4-44 = 
” +4 toy =9. (7) 
Solution. The characteristic equation 4r?+4r+5=0 has complex roots r, 
=—3+i and r,=—4-—i. Consequently, 


eM = ef“ I/24 01 = oe '/2 cost + ie '/* sint 
is a complex-valued solution of (7). Therefore, by Lemma 1, 
Re{e™}=e- ‘cost and Im{e™}=e7'/*sinz 


are two linearly independent real-valued solutions of (7). 


Example 2. Find the solution y(t) of the initial-value problem 


d’y 

We 1 dt +4y =0; y(0)=1, y (0)=1. 
Solution. The characteristic equation r*+2r+4=0 has complex roots r,= 
—1+V3 iandr,=—1-—V3 i. Hence, 

ell = e(“1+V3 t= o- legs V3 t+ ie~'sin V3 t 
is a complex-valued solution of y” +2y’+4y=0. Therefore, by Lemma 1, 
both 


Re{e’™’}=e-‘cosV3¢ and Imf{e"™}=e7'sin V3 
are real-valued solutions. Consequently, 


y(t)=e'| c,cos V3 t+ ¢,sin V3 ¢] 
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for some choice of constants c,,c,. The constants c, and c, are determined 
from the initial conditions 


l=y(0)=c, 
and 
l=y'(0)=—c,+ V3 ¢. 
This implies that 
2 


a=bas oe and y(t)=e' cos V3 + ——sin V3 #), 


EXERCISES 


Find the general solution of each of the following equations. 


1. ‘2 x44, =0 2. 22% 434 +4y=0 
3. £Y 42% 43y=0 4, sf 4 aye 0 
Solve each of the following initial-value problems. 

5, o7 4S 42205 y(0)=1, y’(0)=—2 

6. £7 42% +5y=0; y(0)=0, y'(0)=2 


7, Assume that b?—4ac <0. Show that 
y(t) =e 2/2t~ 10) cos B (t — to) 


and 


VV 4ac — b? 


ya(t)= e220 sin B(t— 19), B= LAE 
are solutions of (1), for any number fp. 


Solve each of the following initial-value problems. 


d*y dy _ 
8. 27 a t=O YD=L y=! 
d*y_»® 14 <0. 1 Whe — 
9. 37 dt +4y=0; y(2)=1, yQ)= ] 


10. Verify that W[e™ cos Bt, e™ sin Bt] = Be’. 


11. Show that e“” is a complex-valued solution of the differential equation y” + 
w*y =0, Find two real-valued solutions. 


12. Show that (cos¢+isin#)’=cosrt+isinrt. Use this result to obtain the double 
angle formulas sin2¢=2sintcost and cos2t=cost—sin7t. 
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13. Show that 
(cost, + isin’, )(cost, + isint,) =cos(t, + f2) + isin(t, + 4). 
Use this result to obtain the trigonometric identities 
cos(f, + 2) = cost; CoS f, — sinf, SINLy, 
sin(t, + t,) =sin¢, Cost, + Cost, Sin fy. 
14. Show that any complex number a+ ib can be written in the form Ae", where 


= Va*+b* and tand=b/a. 


15. Defining the two possible square roots of a complex number Ae” as 
+ VA e'®/2, compute the square roots of i, 1+i, —i, Vi. 


16. Use Problem 14 to find the three cube roots of i. 
17. (a) Let r,=A+iu be a complex root of r?+(a—1)r+B=0. Show that 
prtie me pryin = preln Om = pcos win t + i sin w In Z] 
is a complex-valued solution of Euler’s equation 


,a* 
t 2 5 ot + By = =0. (*) 
dt 2 
(b) Show that r\cospln¢ and ¢*sinlnz are real-valued solutions of (*). 


Find the general solution of each of the following equations. 


d*y _ & 

27 —_—_— =— 

18, POF +1 +y=0, 1>0 
dy, ® 

19. Oa tt tv =0. t>0 


2.2.2 Equal roots; reduction of order 


If b?=4ac, then the characteristic equation ar*+ br+c=0 has real equal 
roots r,=r,= —b/2a. In this case, we obtain only one solution 


y(t)=e "74 
of the differential equation 


d? 

as +52 +cy=0. (1) 
Our problem is to find a second solution which is independent of y,;. One 
approach to this problem is to try some additional guesses. A second, and 
much more clever approach 1s to try and use our knowledge of y,(¢) to help 
us find a second independent solution. More generally, suppose that we 
know one solution y = y,(t) of the second-order linear equation 


Ll y]= 2 +p()% +q(n)y- 0. (2) 


dt 42 
Can we use this solution to help us find a second independent solution? 
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The answer to this question is yes. Once we find one solution y=y,(t) of 
(2), we can reduce the problem of finding all solutions of (2) to that of 
solving a first-order linear homogeneous equation. This is accomplished by 
defining a new dependent variable v through the substitution 


y(th=y,(t)o(2). 


Then 
y_ iO, a 
dt dt at 
and 
a’y _ ayy YI dv M1 + dv 
d= dt? =~ dt dt *' gf? 
Consequently. 
d* 4 dv W d*v dv 
Lly]=0 +2 Ge tp +p([05) a ta tt 
dy dy dv d* yi 
= — — —* + 
yi7a dt? +|2 dt +001] dt + dt? Fh ators 


_, ao ,[,% 
=i dt? +/25 +04) 4 


since y,(¢) is a solution of L[y]=0. Hence, y(t)=y,(2)v(Z) is a solution of 
(2) if v satisfies the differential equation 


y 22 Oy 
ay 42 
Now, observe that Equation (3) is really a first-order linear equation for 
dv / dt. Its solution is 

+p(t) q 


are S50 


cexp(— f rare} -2f a 


cexp( — { p(dat] 


yi(t) 


dy, 
+)2—4 +01] am (3) 


PPALU) 


(4) 


Since we only need one solution v(t) of (3), we set c=1 in (4). Integrating 
this equation with respect to ¢, and setting the constant of integration equal 
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to zero, we obtain that v(t)= [ u(t) dt, where 


exp( — fro at} 


5 
yi(t) ©) 


u(t)= 
Hence, 


va(t)=o()yi(=ri(d) [uaa (6) 


is a second solution of (2). This solution is independent of y,, for if y,(¢) 
were a constant multiple of y,(¢) then v(¢) would be constant, and conse- 
quently, its derivative would vanish identically. However, from (4) 


exp| — [ro ar} 
at yi(t) 
and this quantity is never zero. 


Remark 1. In writing v(t)= J u(t)dt, we set the constant of integration 


equal to zero. Choosing a nonzero constant of integration would only add 
a constant multiple of y,(t) to y,(t). Similarly, the effect of choosing a con- 
stant c other than one in Equation (4) would be to multiply y,(t) by c. 


Remark 2. The method we have just presented for solving Equation (2) is 
known as the method of reduction of order, since the substitution y(t) = 
y,(t)v(2) reduces the problem of solving the second-order equation (2) to 
that of solving a first-order equation. 


Application to the case of equal roots: In the case of equal roots, we found 
y (=e "/*4 as one solution of the equation 

d*y _,& 
a2 Oe + cy =0. (7) 
We can find a second solution from Equations (5) and (6). It is important 
to realize though, that Equations (5) and (6) were derived under the 


assumption that our differential “_ was written in the form 


7 2 +p()% 7 + a(t)y =0; 
that is, the coefficient of y” was one. In our equation, the coefficient of y” 
is a. Hence, we must divide Equation (7) by a to obtain the equivalent 
equation 

y=0. 


d’y pa 
dt2 2 a dt 
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Now, we can insert p(t)=b/a into (5) to obtain that 


Hence, 
vi(=yi(d) f dt=0,(0) 


is a second solution of (7). The functions y,(t) and y,(¢) are clearly linearly 
independent on the interval — 00 <t< oo. Therefore, the general solution 
of (7) in the case of equal roots is 


y(= ce Ota 4 Cote ~ 28/24 x [e1 +c,t]e— "1/24 


Example 1. Find the solution y(t) of the initial-value problem 
d*y dy 
ener ——_ — ° = ’ 0 = 3, 
7p +47 +4y=0; y(0)=1, y‘(0) 


Solution. The characteristic equation r?+4r+4=(r+2)?=0 has two equal 
roots r,=r,= —2. Hence, 


y(th=c,e7*+c,te~* 


for some choice of constants c,, c,. The constants c, and c, are determined 
from the initial conditions 


1=y(0)=c, 
and 
3=y’(0)= —2c¢,+¢,. 


This implies that c,=1 and c,=5, so that y(Qh=(1+50e7~. 


Example 2. Find the solution y(f) of the initial-value problem 


1-7)? 42% -2y 0) y(0)=3, y'(0)=—4 
on the interval —1<t< 1. 


Solution. Clearly, y,(t)=t is one solution of the differential equation 
17y dy 
— 72 . —I)y= 
(1 —t*)—= 2 7 +21 7 2y =0. (8) 


We will use the method of reduction of order to find a second solution 
y,(t) of (8). To this end, divide both sides of (8) by 1—?¢? to obtain the 
equivalent equation 
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Then, from (5) 


exo(— f 1 5 a) 
1-t en(l-?) 4 72 


bd 


and 


yi(t)=tf Ie d= —i(4+1)=-(142) 


is a second solution of (8). Therefore, 
y(t)=et—c,(1 +2’) 
for some choice of constants c,, c,. (Notice that all solutions of (9) are con- 
tinuous at t= +1 even though the differential equation is not defined at 
these points. Thus, it does not necessarily follow that the solutions of a dif- 
ferential equation are discontinuous at a point where the differential equa- 
tion is not defined—but this is often the case.) The constants c, and c, are 
determined from the initial conditions 
3=y(0)=—c, and —4=y’(0)=c,. 

Hence, y(t)= —4¢+3(1+ 2’). 


EXERCISES 


Find the general solution of each of the following equations 


cy dy _ oY dy _ 
Solve each of the following initial-value « oblems 

d*y ad WAN 
3. 95 +6, ty =0; y(0)=1, y'(0)=0 

d“y 
4.422 4% 4y=0; y=0, yO=3 


dt? 
5. Suppose b?=4ac. Show that 
yi(th=e""C~ 9/24 and y2(t)=(t— tp)e PC 100/24 
are solutions of (1) for every choice of fo. 


Solve the following initial-value problems. 


d*y ad 


6. —5 +2 +y=0; yQ=1, y'(Q=-! 


dz 
d*y dy _Nn. _ 7 an 
7. 9 at a +4y=0; y(7)=0, y’(7)=2 


8. Let a, b and c be positive numbers. Prove that every solution of the differential 
equation ay” + by’ + cy =0 approaches zero as ¢ approaches infinity. 
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9. Here is an alternate and very elegant way of finding a second solution y,(/) of 


(1). 
(a) Assume that b?=4ac. Show that 


L[e"|=a(e")” + b(e") +ce"=a(r—- ren 


for r,; = —b/2a. 
(b) Show that 


(d/dr)L[e” ]= L| (0/dr)e” | = L[te"|]=2a(r—r,)e"+at(r—- rye". 


(c) Conclude from (a) and (b) that L[te””’]=0. Hence, y,(t)= te’ is a second 
solution of (1). 


Use the method of reduction of order to find the general solution of the 
following differential equations. 


10. oe aayop (n(=t+1) 
11. oY 4% sa2- 2)y=0 (y(D=e") 

12. a - 22-24 42y=0 (i (O)=) 

13. a+e42 14 +2y=0 (y,()=d 

14. a- 42-24 s6y=0 (y(t) =302—1) 


15. arte? -40+ 1) & +4y=0 (y,()=t4+1) 


dy, WY fy 1\ _ _ sing 
16. ¢ porta t(P-g)=0 (=) 


17. Given that the equation 
dy 
tT - (143) 2 +3y=0 
has a solution of the form e“, for some constant c, find the general solution. 


18. (a) Show that ¢’ is a solution of Euler’s equation 


t?y” +aty’+ By=0, t>0 
if r7>+(a—1)r+ B=0. 
(b) Suppose that (a—1)*=4£. Using the method of reduction of order, show 
that (In‘)r"—™/2 is a second solution of Euler’s equation. 


Find the general solution of each of the following equations. 


dy dy d*y ad 
24 y ——e = 207 — — 
19. 7 +317 +y=0 20. 1° i +y=0 
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2.3 The nonhomogeneous equation 


We turn our attention now to the en equation 


_ dy 
where the functions p(t), a0 and g(t) are continuous on an open interval 


a<t< 8. An important clue as to the nature of all solutions of (1) is pro- 
vided by the first-order linear equation 


dy 
WV ant (2) 


The general solution of this equation is 


+0004 pt a(t)y=a(t) (1) 


_ 2,1 
y(Q=ce' +35. 


Now, observe that this solution is the sum of two terms: the first term, ce e 
is the general solution of the homogeneous equation 


dy 
aH —2t=0 (3) 


while the second term, 3, is a solution of the nonhomogeneous equation 
(2). In other words, every solution Pa) of (2) is the sum of a particular 
solution, (t)= 5, with a solution ce’ * of the homogeneous equation. A sim- 
ilar situation prevails in the case of second-order equations, as we now 
show. 


Theorem 5. Let y,(t) and y,(t) be two linearly independent solutions of the 
homogeneous equation 
d’y 
Ll y]= or) +p() 2 7 tality =0 (4) 


and let {(t) be any particular solution of the nonhomogeneous equation (1). 
Then, every solution y(t) of (1) must be of the form 


YY=HAUV(YN+Qy(H)+ vs) 
for some choice of constants c,, C9. 


The proof of Theorem 5 relies heavily on the following lemma. 


Lemma 1. The difference of any two solutions of the nonhomogeneous equa- 
tion (1) is a solution of the homogeneous equation (4). 
Proor. Let y,(¢) and y,(¢) be two solutions of (1). By the linearity of L, 
Lh v2 (N=LL JO LL] = 8 (0) — 8(4) =0. 


Hence, y,(¢) — (2) is a solution of the homogeneous equation (4). CO 
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We can now give a very simple proof of Theorem 5. 


PROOF OF THEOREM 5. Let y(/) be any solution of (1). By Lemma 1, the 
function $(t)=y(t)— W(t) is a solution of the homogeneous equation (4). 
But every solution ¢(t) of the homogeneous equation (4) is of the form 
o(t)=c, y(t) +c, y(t), for some choice of constants c,, c,. Therefore, 


V(HN=o() +0) =cy(O+ey2.(t) + ¥(¢). O 


Remark. Theorem 5 is an extremely useful theorem since it reduces the 
problem of finding all solutions of (1) to the much simpler problem of find- 
ing just two solutions of the homogeneous equation (4), and one solution 
of the nonhomogeneous equation (1). 


Example 1. Find the general solution of the equation 
— ty=t. (5) 


Solution. The functions y,(t)=cost and y,(¢)=sin¢ are two linearly inde- 
pendent solutions of the homogeneous equation y” + y =0. Moreover, y(t) 
=f is obviously a particular solution of (5). Therefore, by Theorem 5, 
every solution y(t) of (5) must be of the form 


y(t)=c,cost+c,sin¢ +t. 


Example 2. Three solutions of a certain second-order nonhomogeneous 
linear equation are 


Wit=t, wv(t)=tte', and y,()=ltt+e’. 


Find the general solution of this equation. 
Solution. By Lemma 1, the functions 


b2(t)-v, (=e and ¥,(t)-y,(1)=1 
are solutions of the corresponding homogeneous equation. Moreover, these 
functions are obviously linearly independent. Therefore, by Theorem 5, 
every solution y(t) of this equation must be of the form 


y(th=cye'+c, +t. 


EXERCISES 


1. Three solutions of a certain second-order nonhomogeneous linear equation are 


W(t) — 12, W(t) = t+ et 
and 
W3(t)=14+17+2e7!, 


Find the general solution of this equaiica. 
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2. Three solutions of a certain second-order linear nonhomogeneous equation are 
W()=l+e” W(=1+ te” 
and 
W3(=(t+ le" +1 
Find the general solution of this equation. 
3. Three solutions of a second-order linear equation L[ y]= g(t) are 
W(t) =3e' +e", ¥o(t)=Te' +e" 
and 
W3(t)=Se'teP +e". 
Find the solution of the initial-value problem 
Li[y]J=g; y(O)=1, y'(0)=2. 
4, Let a, b and c be positive constants. Show that the difference of any two solu- 
tions of the equation 
ay” + by’+ cy= g(t) 
approaches zero as t approaches infinity. 


5. Let Y(t) be a solution of the nonhomogeneous equation (1), and let (7) be a 
solution of the homogeneous equation (4). Show that Y(t) + ¢(¢) 1s again a solu- 
tion of (1). 


2.4 The method of variation of parameters 
In this section we describe a very general method for finding a particular 
solution W(t) of the nonhomogeneous aun 


L[ y]= p(y — +4q(t)y=8(2), (1) 


dt 42 
once the solutions of the homogeneous equation 


Lly]= 2 +r) F +a(0y= 0 (2) 


are known. The basic principle of this method is to use our knowledge of 
the solutions of the homogeneous equation to help us find a solution of the 
nonhomogeneous equation. 

Let y,(¢) and y,(t) be two linearly independent solutions of the homoge- 
neous equation (2). We will try to find a particular solution y(t) of the 
nonhomogeneous equation (1) of the form 


v(t) = u,(t)y (2) + un(2)y2(2); (3) 
that is, we will try to find functions u,(¢) and u,(t) so that the linear combi- 
nation u,(t)y,(t) + u,(2)y,(2) is a solution of (1). At first glance, this 
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would appear to be a dumb thing to do, since we are replacing the problem 
of finding one unknown function w(t) by the seemingly harder problem of 
finding two unknown functions u,(t) and u,(t). However, by playing our 
cards right, we will be able to find u,(t) and u(r) as the solutions of two 
very simple first-order equations. We accomplish this in the following 
manner. Observe that the differential equation (1) imposes only one condi- 
tion on the two unknown functions u,(¢) and u,(t). Therefore, we have a 
certain “freedom” in choosing u(t) and u,(t). Our goal is to impose an 
additional condition on u,(t) and u,(t) which will make the expression 
L[u, y,;+ uz y] as simple as possible. Computing 


d d 
avia= dt [ uy, + Uy Vy | 


= [up yit Uy V3 | + [ui yt U5 V2 | 
we see that d7/dt*, and consequently L[wW], will contain no second-order 
derivatives of u, and u, if 


1 (t)uy (t) +y2 (tu, (1) = 0. (4) 
This suggests that we impose the condition (4) on the functions u,() and 
u,(t). In this case, then, 


Liv J=[m ity.) +p(t)[m itm] +a()[u 91+ w272] 
=uyyituyst ul yf +p()yi ta) |] + ml J+ POrt+ aD] 
= UL y+ U2 y2 

since both y,(¢) and y,(t) are solutions of the homogeneous equation L[ y] 

=(). Consequently, Y(t)=u, y, + u,y, is a solution of the nonhomogeneous 

equation (1) if u,(7) and u,(¢) satisfy the two equations 
y(t); (1) +y2 (104 (1) =0 
Yi (t)uy (t) + y(t) U9 (4) = 8 (2). 


Multiplying the first equation by y,(7), the second equation by y,(t), and 
subtracting gives 


Li) — i (Dy2(8) Jui (= —8()¥2(), 
while multiplying the first equation by y;(t), the second equation by y,(2), 
and subtracting gives 


Lilt) a(t) — yi (2) y2(t) ws (N= 8(O yi (0). 
Hence, 


ry 8) Y2(4) a SoZ 
nn) WI ypy2 |(2) ang (9) WI yi¥2](t) 


Finally, we obtain u,(¢) and u,(¢) by integrating the right-hand sides of (5). 


(5) 
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Remark. The general solution of the homogeneous equation (2) is 


y(t)=c, y(t) +c, y2(t). 
By letting c, and c, vary with time, we obtain a solution of the nonhomo- 


geneous equation. Hence, this method is known as the method of variation 
of parameters. 


Example 1. 

(a) Find a particular solution y(t) of the equation 
dy 
—-+y=tant 6 
Wty (6) 


on the interval —7/2<t<7/2. 

(b) Find the solution y(¢) of (6) which satisfies the initial conditions y(0)= 
I, y'(0)=1. 

Solution. 

(a) The functions y,(¢)=cost and y,(¢)=sin¢ are two linearly independent 
solutions of the homogeneous equation y” + y =0 with 


Wy ¥2 (4) =¥1¥2 172 = (cost)cost —(—sint)sint=1. 
Thus, from (5), 
ui(¢)=—tantsint and uw5(t)=tantcost. (7) 


Integrating the first equation of (7) gives 


42 
u,(1)=— f tantsinede=— [SOF 


24 
= (SS 4— 1 a =sint —In\sect + tant, 
cost 


=sint—In(sect+tant), — 5 < t<z 


while integrating the second equation of (7) gives 


un(1)= f tantcosrdt= f sintdt = — cost. 


Consequently, 
W(t) =cost| sint—In(sect + tan‘) | + sint(—cos‘) 
= —cosfIn(sec¢+ tant) 


is a particular solution of (6) on the interval —7/2<t<7/2. 
(b) By Theorem 5 of Section 2.3, 


y(t) =c,cost+c,sint—costln(sect + tant) 


for some choice of constants c,, c,. The constants c, and c, are determined 
from the initial conditions 


l=y(0)=c, and 1=y’(0)=c,-—1. 
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Hence, c, =1, c,=2 and 
y(t) =cost+2sint—costln(secr+ tant). 


Remark. Equation (5) determines u,(t) and u,(t) up to two constants of in- 
tegration. We usually take these constants to be zero, since the effect of 
choosing nonzero constants is to add a solution of the homogeneous equa- 
tion to (fz). 


EXERCISES 
Find the general solution of each of the following equations. 


d*y 


1. We 12 8h ~F<I<F 
2. FY 4D 4 ay =e 

3. fF 3B 4 ya(et de 
4, oY 3% sry ete 


Solve each of the following initial-value problems. 
5, 3y”’+4y’+y=(sinte‘; y(O)=1, y’(0)=0 

6. yy" +4y'+4y = 19/7%e-%; yO) =y'(0)=0 

7. y”—3y'+2y=ViIt1; y(0)=y'(0)=0 

8. y"-y=f(); yO)=y'(0)=0 


Warning. It must be remembered, while doing Problems 3 and 5, that 
Equation (5) was derived under the assumption that the coefficient of y” 
was one. 


9, Find two linearly independent solutions of t“y” —2y=0 of the form y()=1’. 
Using these solutions, find the general solution of t?7y” —2y =2?. 


10. One solution of the equation 
y"+p(t)y'+q(t)y =0 (*) 


is (1+ 12)*, and the Wronskian of any two solutions of (*) is constant. Find the 
general solution of 


y"+p(t)y+q(ty=l+e. 


11. Find the general solution of y” +(1/427)y =fcost, t>0, given that y,())= Vt 
is a solution of the homogeneous equation. 
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12. Find the general solution of the equation 
d*y 24 w& 2 


Oy * ve 2 
dw tapad?yypr tt: 


13. Show that sect+ tant is positive for —7/2<t<a/2. 


2.5 The method of judicious guessing 


A serious disadvantage of the method of variation of parameters is that the 
integrations required are often quite difficult. In certain cases, it is usually 
much simpler to guess a particular solution. In this section we will establish 
a systematic method for guessing solutions of the equation 


d’y _,.& 
a2 +b + cy=2(t) (1) 


where a, b and c are constants, and g(t) has one of several special forms. 
Consider first the differential equation 


dy _,® n 
Ll y]=a7 7 tb ta dot att... + anf . (2) 


We seek a function y(t) such that the three functions ay”, by’ and cy add 
up to a given polynomial of degree n. The obvious choice for y(t) is a 
polynomial of degree n. Thus, we set 


V(t)=AQtA tt... +A4,t” (3) 
and compute 
L[ ¥](t) = ay" (1) + by'(1) + ob (1) 
=al2A,+...+n(n—1)A,t"-7]+b[ A, +... +4,t"-"] 
+c[ Ag+ Ajtt+...+A,t"] 
=cA,t"+(cA,_,+nbA,)t"~'+...+(cAg+bA,+2aA,). 
Equating coefficients of like powers of ¢ in the equation 
Lil p](t)=agtatt...+a,t" 
gives 
cA, =4a,,CA,_,+nbA,=4a,_,...,CAgt+ bA, +2aA,= do. (4) 
The first equation determines A, =a,/c, for c#0, and the remaining equa- 
tions then determine A,_,,...,A 9 successively. Thus, Equation (1) has a 
particular solution ¥(7) of the form (3), for c+0. 
We run into trouble when c=0, since then the first equation of (4) has 


no solution A,. This difficulty is to be expected though, for if c=0, then 
L{y]= ap” + by’ is a polynomial of degree n—1, while the right hand side 
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of (2) is a polynomial of degree n. To guarantee that ay” + by’ is a poly- 
nomial of degree n, we must take w as a polynomial of degree n+ 1. Thus, 
we set 


W(t)=tl Agt Att... +A,t"]. (5) 


We have omitted the constant term in (5) since y =constant is a solution of 
the homogeneous equation ay” + by’=0, and thus can be subtracted from 
W(t). The coefficients Ay, A,,...,4, are determined uniquely (see Exercise 
19) from the equation 


ap” + by’ =agtat+...+a,t" 


if b#0. 

Finally, the case b=c=0 1s trivial to handle since the differential equa- 
tion (2) can then be integrated immediately to yield a particular solution 
W(t) of the form 


()= fot 4 ae at a,t"*? 
v 2: 37 * G4 ln 42) 
Summary. The differential equation (2) has a solution y(f) of the form 
AptA tt... +A,t", c#0 
W(th=i t(AgtAytt...+A,0"),  c=0,b40, 
P(AyptA tt... +41"), c=b=0 


Example 1. Find a particular solution (t) of the equation 
d*y | dy 
L| y|=—> —+y=/7’, (6) 


Solution. We set p(t)=A,)+A,t+A,t* and compute 
Liv ](Q=v" (a +y() + H(t) 
=24,+(A,+24,1)+ A +A tt+A,0? 
=(A,+A,+2A,)+(A,+24,)t+ A,0?. 
Equating coefficients of like powers of ¢ in the equation L[W](t)=?° gives 
A,=1, A,+2A,=0 
and 
A t+A,+2A,=0. 


The first equation tells us that 4,=1, the second equation then tells us that 
A,=-—2, and the third equation then tells us that A7=0. Hence, 


W(t)=—2t+2? 


is a particular solution of (6). 
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Let us now re-do this problem using the method of variation of parame- 
ters. It is easily verified that 
yi(t)=e cos V3 t/2 and y,(t)=e ‘sin V3 t/2 
are two solutions of the homogeneous equation L[ y]=0. Hence, 
W(t)=u,(te~'/*cos V3 t/2+ u,(t)e~/*sin V3 t/2 


is a particular solution of (6), where 


—pe-t/ ‘sin V3 ¢/2 _ -2 t/des 
u,(t)= I—Wooua 4 t= ve 2sin V3 t/2adt 


and 


t7e~'/2 cos V3 t/2 
u,(t)= [eae Je e'/2cos V3 t/2dt. 
WL y2](2) 
These integrations are extremely difficult to perform. Thus, the method of 
guessing is certainly preferrable, in this problem at least, to the method of 
variation of parameters. 
Consider now the differential equation 
d*y dy n\ , at 

Lil y|= as tba tea (agt att... +a ye™. (7) 
We would like to remove the factor e® from the right-hand side of (7), so 
as to reduce this equation to Equation (2). This is accomplished by setting 
y(t) =e™v(t). Then, 


y=e"(v'tav) and y”=e% (v” +2av’+a’v) 
so that 

LL y]=e™| av” + (2aa + b)v' + (aa? + ba + c)o]. 
Consequently, y(t) = ev(Z) is a solution of (7) if, and only if, 


a2? + (200+ 6) 2 + (aa? +batcw=aytatt...ta,t". (8) 


In finding a particular solution v(t) of (8), we must distinguish as to 
whether (i) aa*+ ba+c#0; (ii) aa?+ ba+c=0, but 2aa+b¥<0; and (iii) 
both aa*+ ba+c and 2aa+ b=0. The first case means that a is not a root 
of the characteristic equation 


ar? + br+c=0. (9) 


In other words, e™ is not a solution of the homogeneous equation L[ y]=0. 
The second condition means that a is a single root of the characteristic 
equation (9). This implies that e® is a solution of the homogeneous equa- 
tion, but te“ is not. Finally, the third condition means that a is a double 
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root of the characteristic equation (9), so that both e™ and te™ are solu- 
tions of the homogeneous equation. Hence, Equation (7) has a particular 
solution W(t) of the form (i) ¥(t)=(Apt+... +4, t”)e™, if e™ is not a solu- 
tion of the homogeneous equation; (ii) Y(t) =t(Apt+... +A, t”)e™, if e™ is 
a solution of the homogeneous equation but te“ is not; and (iii) Y(t)= 
t(Ag+...+A,t")e™ if both e® and te“ are solutions of the homogeneous 
equation. 


Remark. There are two ways of computing a particular solution y/() of (7). 
Either we make the substitution y = e“v and find v(t) from (8), or we guess 
a solution y(t) of the form e® times a suitable polynomial in ¢. If a is a 
double root of the characteristic equation (9), or if n >2, then it is advis- 
able to set y=ev and then find v(t) from (8). Otherwise, we guess y(t) 
directly. 


Example 2. Find the general solution of the equation 


dy ,& 
= +4y=(l+rt+...+2?")e% 10 
G2 at y=(1+r+ Je (10) 
Solution. The characteristic equation r?>—4r+4=0 has equal roots r,; =r, 
=2. Hence, y,(‘)=e~ and y,(t)=te*! are solutions of the homogeneous 
equation y” —4y’+4y =0. To find a particular solution y(t) of (10), we set 
y=ey, Then, of necessity, 
d*v 
=l+tte?+...+27", 
dt? 
Integrating this equation twice, and setting the constants of integration 
equal to zero gives 
\= 72 fi 3 Po 
(Y= Ty + 73 FF 98-29" 
Hence, the general solution of (10) is 


—_ 2t 2t 2t a {? 

y(tHh=cye" + cyte" +e me 78. 8 | 
129 

=etleteytt tat ay 25-25 | 


It would be sheer madness (and a terrible waste of paper) to plug the ex- 
pression 
W(th=0?(AgHAtt ... + Ant? )e™ 


into (10) and then solve for the coefficients Ap, A,,...,A47. 


Example 3. Find a particular solution y(¢) of the equation 


d*y dy _ 3t 
Ll y|= 7 37 +2y=(1+t te , 
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ae In this case, e*’ is not a solution of the homogeneous equation 
—3y’+2y=0. Thus, we set Y(t) =(A,+ A ,fe*. Computing 


L[W]()= 9" —3y' +24 
= e*| (949 +64, +9A jt) —3(3A9+ A, +3A jt) +2(Ay+A,1) | 
= e*| (24) +341) +240] 
and cancelling off the factor e*’ from both sides of the equation 
Liv] =(14e%, 
gives 
2A ,t+(2A,+34,)=1 41. 
This implies that 24,=1 and 24,+34,=1. Hence, A,=+, Ag=—j and 
W(n=(-—F+4/2)e™. 
Finally, we consider the differential equation 


d*y dy COS wt 
Ll y|= a> +b 7 +=(atatt.. ta,t")x{ ¢ tet (11) 


We can reduce the problem of finding a particular solution (2) of (11) to 
the simpler problem of finding a particular solution of (7) with the aid of 
the following simple but extremely useful lemma. 


Lemma 1. Let y(t)= u(t) + iv(t) be a complex-valued solution of the equa- 
tion 


Ly]=a2% +64 + or=a()=a(+ eal (12) 


where a, b and c are real. This means, of course, that 


al u’(t)+ iv” (t) | + b| u'(t)+ iv’(t) | +c[ u(t)+ iv(t) | =¢1(t)+ig,(t). 


(13) 
Then, L[u)(t)= g(t) and L[v](t)=g,(?). 
ProoF. Equating real and imaginary parts in (13) gives 
au” (t)+ bu'(t) + cu(t)=g,(t) 
and 
av” (t) + bu’(t) + cv(t) = g, (7). O 


Now, let $(¢)= u(t)+ iv(t) be a particular solution of the equation 


d*y dy nN\ plw 
aa +b + cy =(ayt+...+a,t”)e™. (14) 


The real part of the right-hand side of (14) is (ayg+ ... +a,t”)coswt, while 
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the imaginary part is (ag+... +a4,/")sinwt. Hence, by Lemma 1 
u(t)=Re{o(t)} 
is a solution of 
ay” + by’+cy =(agt... ta,t") coswt 
while 
o(t)=Im{o(¢)} 
is a solution of 
ay" + by'+ cy =(ayot+... ta,t”)sinwt. 


Example 4. Find a particular solution y(t) of the equation 
y ; 
L = — +4y=sin2t. 15 


Solution. We will find Y(t) as the imaginary part of a complex-valued solu- 
tion ¢(t) of the equation 
d*y 
Ll y | dt? 
To this end, observe that the characteristic equation r?+4=0 has complex 
roots r= +2i. Therefore, Equation (16) has a particular solution @(t) of the 
form $(t)=A,te*”. Computing 


+4y =e", (16) 


o'(t)=A,(1+2it)e™ and ”(t)=A,(4i-4t)e*" 
we see that 


L[ $|(t)=9"(t) +49(t) =4iAye™. 
Hence, Ag=1/4i= —i/4 and 
—— tow _ it ‘sindr)= b sindr—-it 
o(t) 4° 4 (cos2¢+ isin2r) 4 sin 2t i4 cos 2t. 
Therefore, y(t) =Im{¢$(t)} = —(¢/4)cos2t is a particular solution of (15). 
Example 5. Find a particular solution y(t) of the equation 


dy 
— +4y=cos2r. 17) 
pry ( 


Solution. From Example 4, $(t)=(t/4)sin2t—i(t/4)cos2¢ is a complex- 
valued solution of (16). Therefore, 


W(t) =Re{¢(t)} = ri sin 2t 
is a particular solution of (17). 
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Example 6. Find a particular solution W(t) of the equation 


d’y 4B ' 
Ly|=—7 3 ta +y=te'cost. (18) 
Solution. Observe that te‘ cost is the real part of te“'+". Therefore, we can 
find y(t) as the real part of a complex-valued solution ¢$(t) of the equation 


d’y _.& 
Ly]=—3 +2 +y=telt ye (19) 


To this end, observe that 1+/i is not a root of the characteristic equation 
r?+2r+1=0. Therefore, Equation (19) has a particular solution #(f) of the 
form $(t)=(A,+A,te"*. Computing L[¢]=o” +2¢’+¢, and using the 
identity 


(1+? 4201+) 4+1=[(1+a41] =(2+3)7 
we see that 
[(2+ i) Att (2+ i) Ag+ 2(2+ iA, | =F. 
Equating coefficients of like powers of ¢ in this equation gives 
(2+i)A,=1 
and 
(2+ i)Agt+2A,=0. 
This implies that A, =1/(2+i)? and Ay= —2/(2+ 1), so that 


—2 4 1 


(2+i)  (2+i) 


(1+ a)t 


ée 


o(t)= 


After a little algebra, we find that 
e! , 
o()= Ts { | (15t—4)cos¢+ (204 —22)sin | 


+ i[ (22—201)cost + (15t—4)sinr]}. 
Hence, 


W(t) =Re{o(t)} = < [ (15t—4)cosz + (202 —22)sint |. 


Remark. The method of judicious guessing also applies to the equation 


d*y dy n 
Ll y]=a—-5 tba t+ay= > pte (20) 
at u j=l 


where the p,(t),j=1,...,2 are polynomials in ¢. Let y(t) be a particular 
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solution of the equation 
LI y| =p(the™",  j=l,...yn. 
Then, Y(t) = Z7_,¥;,(2) is a solution of (20) since 
Su]= 3 elude 3 ninew 
j=) j=l j=) 


Thus, to find a particular solution of the equation 


Lil y|=L 


y"+y’+y=e'+tsint 
we find particular solutions y,(¢) and y,(¢) of the equations 
y"+y' +y=e’ and y”"+y’+y=tsint 


respectively, and then add these two solutions together. 


EXERCISES 


Find a particular solution of each of the following equations. 


1 y’+3y=P-1 2. y" +4y'+4y = te! 

3. y"-y=t'e! 4. y"+y'ty=14tt4+7 

5. y"+2y’+y=e' 6. y" +5y'+4y= te” 

7. yy" +4y=rsin2¢ 8. y” —6y’+ 9y =(3t’ —5S5r*)e* 

9, y” —2y’+5y =2cos7F 10. y” —2y’ + 5y =2(cos?r)e! 

11. y”+y’—6y =sint + te?! 12. yp’ +y'+4y = 17+ (2¢4+3)(1+cos2) 
13. y”—3y’+2y=e' +e” 14. y"+2y’=14+7+e7 

15. y” +y =costcos2t 16. y” + y =costcos2tcos3rt. 


17, (a) Show that cos’wt = 1Re{e7" + 3e}, 
Hint: coswt =(e' + e~ )/2, 
(b) Find a particular solution of the equation 


10y” +0.2y’ + 1000y = 5 + 20cos? 102 
18. (a) Let L[y]=y” —2r, y’+r,’y. Show that 
L[| e™v(t) | =e"'"(2). 
(b) Find the general solution of the equation 
y" _ 6y’ + 9y = 3/2638, 


19. Let Y(t)=t(Ap+...+A,t”), and assume that b0. Show that the equation 
ay” + by’ =agt+...+a,t”" determines Ap,...,A, uniquely. 
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2.6 Mechanical vibrations 


Consider the case where a small object of mass m is attached to an elastic 
spring of length /, which is suspended from a rigid horizontal support (see 
Figure 1). (An elastic spring has the property that if it is stretched or com- 
pressed a distance A/ which is small compared to its natural length /, then 
it will exert a restoring force of magnitude kA/. The constant k is called 
the spring-constant, and is a measure of the stiffness of the spring.) In 
addition, the mass and spring may be immersed in a medium, such as oil, 
which impedes the motion of an object through it. Engineers usually refer 
to such systems as spring-mass-dashpot systems, or as seismic instruments, 
since they are similar, in principle, to a seismograph which 1s used to detect 
motions of the earth’s surface. 


Figure 1 


Spring-mass-dashpot systems have many diverse applications. For ex- 
ample, the shock absorbers in our automobiles are simple spring-mass- 
dashpot systems. Also, most heavy gun emplacements are attached to such 
systems so as to minimize the “recoil” effect of the gun. The usefulness of 
these devices will become apparent after we set up and solve the differen- 
tial equation of motion of the mass m. 

In calculating the motion of the mass m, it will be convenient for us to 
measure distances from the equilibrium position of the mass, rather than 
the horizontal support. The equilibrium position of the mass is that point 
where the mass will hang at rest if no external forces act upon it. In 
equilibrium, the weight mg of the mass is exactly balanced by the restoring 
force of the spring. Thus, in its equilibrium position, the spring has been 
stretched a distance A/, where k Al= mg. We let y=O denote this equi- 
librium position, and we take the downward direction as positive. Let y(¢) 
denote the position of the mass at time ¢. To find y(t), we must compute 
the total force acting on the mass m. This force is the sum of four separate 
forces W, R, D and F. 
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(i) The force W=mg_g is the weight of the mass pulling it downward. 
This force is positive, since the downward direction is the positive y direc- 
tion. 

(11) The force R is the restoring force of the spring, and it is proportional 
to the elongation, or compression, A/+y of the spring. It always acts to re- 
store the spring to its natural length. If A/+y>0, then R is negative, so 
that R= —k(A/+y), and if AJ/+y <0, then R 1s positive, so that R= 
—k(Al+y). In either case, 


R=—k(Al+y). 


(111) The force D is the damping, or drag force, which the medium exerts 
on the mass m. (Most media, such as oil and air, tend to resist the motion 
of an object through it.) This force always acts in the direction opposite the 
direction of motion, and is usually directly proportional to the magnitude 
of the velocity dy /dt. If the velocity is positive; that is, the mass is moving 
in the downward direction, then D= —cdy/dt, and if the velocity is nega- 
tive, then D= —cdy/dt. In either case, 


D=—cdy/dt. 


(iv) The force F is the external force applied to the mass. This force is 
directed upward or downward, depending as to whether F is positive or 
negative. In general, this external force will depend explicitly on time. 

From Newton’s second law of motion (see Section 1.7) 

2 


d*y 
m—=W+R+D+4+F 
dt” 


= mg — k(Al+y) 0 + F(t) 


—ky-c af + F(t), 
since mg=k Al. Hence, the position 40 of the mass satisfies the second- 
order linear differential equation 

d*y _ ad 

ma te7 thy= F(t) (1) 

where m, c and k are nonnegative constants. We adopt here the mks system 
of units so that F is measured in newtons, y is measured in meters, and f is 
measured in seconds. In this case, the units of k are N/m, the units of c are 
N-s/m, and the units of m are kilograms (N-s?/m) 


(a) Free vibrations: 


We consider first the simplest case of free undamped motion. In this case, 
Equation (1) reduces to 


mi? d*y 
m— thy= =0 or Gr tr =0 (2) 
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where w,=k/m. The general solution of (2) is 
y(t) =acoswot + bsin wo. (3) 


In order to analyze the solution (3), it is convenient to rewrite it as a single 
cosine function. This is accomplished by means of the following lemma. 


Lemma 1. Any function y(t) of the form (3) can be written in the simpler 
form 
y(t) = Reos(wot — 6) (4) 
where R= Va?+b? and &=tan~'b/a. 


Proor. We will verify that the two expressions (3) and (4) are equal. To 
this end, compute 


Rcos(wot — 6) = Reoswot cosd + R sin wot sind 
and observe from Figure 2 that Rcosé=a and Rsinéd=b. Hence, 


R cos(Wot — 6) = a COS Wot + bsin wot. O 


v 
00) 
E 


A 


e 
Figure 2 


In Figure 3 we have graphed the function y = Rcos(wot — 6). Notice that 
y(t) always lies between — R and + R, and that the motion of the mass is 
periodic—it repeats itself over every time interval of length 27/w . This 
type of motion is called simple harmonic motion; R is called the amplitude 
of the motion, 6 the phase angle of the motion, 7)=27/w, the natural 
period of the motion, and w= Vk/m the natural frequency of the sys- 
tem. 


(b) Damped free vibrations: 


If we now include the effect of damping, then the differential equation 
governing the motion of the mass is 


d*y 
arr) tc +ky =0. (5) 
The roots of the characteristic equation mr2+ cr+k=0 are 
aet Vici ~4km | ae7 Vet = 4km_ 


r= i and r,= am 
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Figure 3. Graph of y(t)= Rcos (wot — 6) 


Thus, there are three cases to consider, depending as to whether c?—4km 
Is positive, negative or zero. 
(i) c?—4km>0. In this case both r, and r, are negative, and every solu- 
tion y(t) of (5) has the form 
y(t)=ae"™ + be™, 
(ii) c?—4km=0. In this case, every solution y(f) of (5) is of the form 
y(t)=(at btje~%/2", 
(iii) c?7—4km <0. In this case, every solution y(t) of (5) is of the form 
1 / a2 
y(t)=e7%/?"| acosptt bsin pr], p= ee 


The first two cases are referred to as overdamped and critically damped, 
respectively. They represent motions in which the originally displaced mass 
creeps back to its equilibrium position. Depending on the initial condi- 
tions, it may be possible to overshoot the equilibrium position once, but no 
more than once (see Exercises 2-3). The third case, which is referred to as 
an underdamped motion, occurs quite often in mechanical systems and 
represents a damped vibration. To see this, we use Lemma 1 to rewrite the 
function 


y(th=e7%/?™| acospt + bsin ut] 
in the form 
y(t) = Re~“/?" cos( ut— 8). 


The displacement y oscillates between the curves y= + Re~“/*", and thus 

represents a cosine curve with decreasing amplitude, as shown in Figure 4. 
Now, observe that the motion of the mass always dies out eventually if 

there is damping in the system. In other words, any initial disturbance of 
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Figure 4. Graph of Re~“/?" cos( pt — 6) 


the system is dissipated by the damping present in the system. This is one 
reason why spring-mass-dashpot systems are so useful in mechanical sys- 
tems: they can be used to damp out any undesirable disturbances. For ex- 
ample, the shock transmitted to an automobile by a bump in the road 1s 
dissipated by the shock absorbers in the car, and the momentum from the 
recoil of a gun barrel is dissipated by a spring-mass-dashpot system 
attached to the gun. 


(c) Damped forced vibrations: 


If we now introduce an external force F(¢)= Fycoswt, then the differential 
equation governing the motion of the mass is 


d*y _ & 
m3 ter + ky = Fycosut. (6) 


Using the method of judicious guessing, we can find a particular solution 
W(t) of (6) of the form 


Fy 


(k— mw?) + ¢7w? 


¥(1)= 


[ (kK — mw’) coswt + cwsin wt | 


F 1/2 
=——— [(k- mu*) + c%w*| " cos(wt ~ 8) 


(k- mw?) + 07 
Fy cos(wt — 6 ) 
“TO 2 a a2 ™) 
| (x — mw) + cu? | 
where tand=cw/(k— mw’). Hence, every solution y(t) of (6) must be of 
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the form 
Fy cos(wt — 6) 


y(tH=O(1) + Y()=o() + ; 72 (8) 
| (A —mw’) + 70 | 
where ¢(f) is a solution of the homogeneous equation 
d*y _ w& 
me te +ky =0. (9) 


We have already seen though, that every solution y = (ft) of (9) approaches 
zero as t approaches infinity. Thus, for large ¢, the equation y(¢)=y(f) de- 
scribes very accurately the position of the mass m, regardless of its initial 
position and velocity. For this reason, y(f) is called the steady state part of 
the solution (8), while $(f) is called the transient part of the solution. 


(d) Forced free vibrations: 


We now remove the damping from our system and consider the case of 
forced free vibrations where the forcing term is periodic and has the form 
F(t)= F,coswt. In this case, the differential equation governing the motion 
of the mass m is 


d*y Fy 
Gi + 80Y = 7 Cost, we=k/m. (10) 
The case ww, 1S uninteresting; every solution y(t) of (10) has the form 
F 
y(t) =c, CoS wot + c, Sin Wot + STS COS wt, 
m(wo - w*) 


and thus is the sum of two periodic functions of different periods. The in- 
teresting case is when w=w,; that is, when the frequency w of the external 
force equals the natural frequency of the system. This case is called the res- 
onance case, and the differential equation of motion for the mass m is 
d’y Fo 
— +upy = — coswof. 1] 
A ae 0 (11) 


We will find a particular solution y(t) of (11) as the real part of a 
complex-valued solution ¢(t) of the equation 


Fo IWot 
Ta tHy=— ene. (12) 


Since e“* is a solution of the homogeneous equation y” +wy =0, we 
know that (12) has a particular solution $(t)= Ate’*”, for some constant A. 
Computing 


b” + who = iw Ae?! 
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we see that 
1 Fo —IFo 
7 iw, nm 2MwWo 
Hence, 
— iF ot 
o(t)= Ime, (COS Wot + i SiN wWof) 
Fot _ Fot 
= Imo SIN Wol — "Imo COS Wol 
is a particular solution of (12), and 
Fot 
W(t) =Re{¢(t)}= Imus, SIN Wof 


is a particular solution of (11). Consequently, every solution y(t) of (11) is 
of the form 


Fot 
Y(t) HC, COS Wot + Cy SIN Wot + Tos ieo! (13) 


for some choice of constants c),¢>. 

Now, the sum of the first two terms in (13) is a periodic function of 
time. The third term, though, represents an oscillation with increasing am- 
plitude, as shown in Figure 5. Thus, the forcing term Fy)coswt, if it is in res- 
onance with the natural frequency of the system, will always cause un- 
bounded oscillations. Such a phenomenon was responsible for the collapse 
of the Tacoma Bridge, (see Section 2.6.1) and many other mechanical 
catastrophes. 


y 


Figure 5. Graph of f(t) = Atsinwot 
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EXERCISES 


1. 


It is found experimentally that a 1 kg mass stretches a spring 49/320 m. If the 
mass is pulled down an additional 1/4 m and released, find the amplitude, 
period and frequency of the resulting motion, neglecting air resistance (use 
g=9.8 m/s’). 


. Let y(t)= Ae’ + Be™, with |A|+|B| +40. 


(a) Show that y(t) is zero at most once. 
(b) Show that y’(f) is zero at most once. 


. Let y(t) =(A + Boe”, with |A|+|B| <0. 


(a) Show that y(t) is zero at most once. 
(b) Show that y’(f) is zero at most once. 


. A small object of mass 1 kg is attached to a spring with spring constant 2N /m. 


This spring—mass system is immersed in a viscous medium with damping 
constant 3 N-s/m. At time t=0, the mass is lowered 1/2 m_ below its 
equilibrium position, and released. Show that the mass will creep back to its 
equilibrium position as ¢ approaches infinity. 


. A small object of mass 1 kg is attached to a spring with spring-constant | N/m 


and is immersed in a viscous medium with damping constant 2 N-s/m. At time 
t= 0, the mass is lowered 1/4 m and given an initial velocity of 1 m/s in the 
upward direction. Show that the mass will overshoot its equilibrium position 
once, and then creep back to equilibrium. 


. A small object of mass 4 kg is attached to an elastic spring with spring-constant 


64 N/m, and is acted upon by an external force F(t) = A cos?wt. Find all values 
of w at which resonance occurs. 


. The gun of a U.S. M60 tank is attached to a spring—mass—dashpot system with 


spring-constant 100a? and damping constant 200a, in their appropriate units. 
The mass of the gun is 100 kg. Assume that the displacement y(t) of the gun 
from its rest position after being fired at time t=0 satisfies the initial-value 
problem 


100y’”+200ay’+ 100a*y =0; y(0) =0, y’(0) =100 m/s. 


It is desired that one second later, the quantity y” +(y’)” be less than .01. How 
large must a be to guarantee that this is so? (The spring—mass—dashpot 
mechanism in the M60 tanks supplied by the U.S. to Israel are critically 
damped, for this situation is preferable in desert warfare where one has to fire 
again as quickly as possible). 


. A spring—mass—dashpot system has the property that the spring constant k is 9 


times its mass m, and the damping constant c is 6 times its mass. At time ¢ = 0, 
the mass, which is hanging at rest, is acted upon by an external force F(t) = 
(3sin3t) N. The spring will break if it is stretched an additional 5 m from its 
equilibrium position. Show that the spring will not break if m>1/5 kg. 


. A spring—mass—dashpot system with m=1, k =2 and c= 2 (in their respective 


units) hangs in equilibrium. At time ¢ = 0, an external force F(t) =a —tN acts 
for a time interval 7. Find the position of the mass at anytime ¢> 7. 
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10. A 1 kg mass is attached to a spring with spring constant k = 64 N/m. With the 
mass on the spring at rest in the equilibrium position at time t= 0, an external 
force F(t) = (41) N is applied until time t, = 77/16 seconds, at which time it is 
removed. Assuming no damping, find the frequency and amplitude of the 
resulting oscillation. 


11. A 1 kg mass is attached to a spring with spring constant k = 4 N/m, and hangs 
in equilibrium. An external force F(t) =(1+¢t-+sin2t) N is applied to the mass 
beginning at time ¢=0. If the spring is stretched a length (1/2+ 7/4) m or 
more from its equilibrium position, then it will break. Assuming no damping 
present, find the time at which the spring breaks. 


12. A small object of mass 1 kg is attached to a spring with spring constant k =1 
N/m. This spring-mass system is then immersed in a viscous medium with 
damping constant c. An external force F(t)=(3—cost) N is applied to the 
system. Determine the minimum positive value of c so that the magnitude of the 
steady state solution does not exceed 5 m. 


13. Determine a particular solution Y(t) of my” +cy’+ ky = Focoswt, of the form 
W(t)= A cos(wt—). Show that the amplitude 4 is a maximum when w?=«} 
—1(c/m)*. This value of w is called the resonant frequency of the system. What 


happens when wé < }(c/m)*? 


2.6.1 The Tacoma Bridge disaster 


On July 1, 1940, the Tacoma Narrows Bridge at Puget Sound in the state 
of Washington was completed and opened to traffic. From the day of its 
opening the bridge began undergoing vertical oscillations, and it soon was 
nicknamed “Galloping Gertie.” Strange as it may seem, traffic on the 
bridge increased tremendously as a result of its novel behavior. People 
came from hundreds of miles in their cars to enjoy the curious thrill of 
riding over a galloping, rolling bridge. For four months, the bridge did a 
thriving business. As each day passed, the authorities in charge became 
more and more confident of the safety of the bridge—so much so, in fact, 
that they were planning to cancel the insurance policy on the bridge. 
Starting at about 7:00 on the morning of November 7, 1940, the bridge 
began undulating persistently for three hours. Segments of the span were 
heaving periodically up and down as much as three feet. At about 10:00 
a.m., something seemed to snap and the bridge began oscillating wildly. At 
one moment, one edge of the roadway was twenty-eight feet higher than 
the other; the next moment it was twenty-eight feet lower than the other 
edge. At 10:30 a.m. the bridge began cracking, and finally, at 11:10 a.m. 
the entire bridge came crashing down. Fortunately, only one car was on 
the bridge at the time of its failure. It belonged to a newspaper reporter 
who had to abandon the car and its sole remaining occupant, a pet dog, 
when the bridge began its violent twisting motion. The reporter reached 
safety, torn and bleeding, by crawling on hands and knees, desperately 
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clutching the curb of the bridge. His dog went down with the car and the 
span—the only life lost in the disaster. 

There were many humorous and ironic incidents associated with the col- 
lapse of the Tacoma Bridge. When the bridge began heaving violently, the 
authorities notified Professor F. B. Farquharson of the University of 
Washington. Professor Farquharson had conducted numerous tests on a 
simulated model of the bridge and had assured everyone of its stability. 
The professor was the last man on the bridge. Even when the span was tilt- 
ing more than twenty-eight feet up and down, he was making scientific ob- 
servations with little or no anticipation of the imminent collapse of the 
bridge. When the motion increased in violence, he made his way to safety 
by scientifically following the yellow line in the middle of the roadway. 
The professor was one of the most surprised men when the span crashed 
into the water. 

One of the insurance policies covering the bridge had been written by a 
local travel agent who had pocketed the premium and had neglected to re- 
port the policy, in the amount of $800,000, to his company. When he later 
received his prison sentence, he ironically pointed out that his embezzle- 
ment would never have been discovered if the bridge had only remained 
up for another week, at which time the bridge officials had planned to 
cancel all of the policies. 

A large sign near the bridge approach advertised a local bank with the 
slogan “as safe as the Tacoma Bridge.” Immediately following the collapse 
of the bridge, several representatives of the bank rushed out to remove the 
billboard. 

After the collapse of the Tacoma Bridge, the governor of the state of 
Washington made an emotional speech, in which he declared “We are 
going to build the exact same bridge, exactly as before.” Upon hearing 
this, the noted engineer Von Karman sent a telegram to the governor stat- 
ing “If you build the exact same bridge exactly as before, it will fall into 
the exact same river exactly as before.” 

The collapse of the Tacoma Bridge was due to an aerodynamical phe- 
nomenon known as Stall flutter. This can be explained very briefly in the 
following manner. If there is an obstacle in a stream of air, or liquid, then 
a “vortex street” is formed behind the obstacle, with the vortices flowing 
off at a definite periodicity, which depends on the shape and dimension of 
the structure as well as on the velocity of the stream (see Figure 1). As a 
result of the vortices separating alternately from either side of the obstacle, 
it is acted upon by a periodic force perpendicular to the direction of the 
stream, and of magnitude Fy)coswt. The coefficient Fy depends on the 
shape of the structure. The poorer the streamlining of the structure; the 
larger the coefficient Fj, and hence the amplitude of the force. For exam- 
ple, flow around an airplane wing at small angles of attack is very smooth, 
so that the vortex street is not well defined and the coefficient Fp is very 
small. The poorly streamlined structure of a suspension bridge is another 
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Figure | 


matter, and it is natural to expect that a force of large amplitude will be set 
up. Thus, a structure suspended in an air stream experiences the effect of 
this force and hence goes into a state of forced vibrations. The amount of 
danger from this type of motion depends on how close the natural 
frequency of the structure (remember that bridges are made of steel, a 
highly elastic material) is to the frequency of the driving force. If the two 
frequencies are the same, resonance occurs, and the oscillations will be de- 
structive if the system does not have a sufficient amount of damping. It has 
now been established that oscillations of this type were responsible for the 
collapse of the Tacoma Bridge. In addition, resonances produced by the 
separation of vortices have been observed in steel factory chimneys, and in 
the periscopes of submarines. 

The phenomenon of resonance was also responsible for the collapse of 
the Broughton suspension bridge near Manchester, England in 1831. This 
occurred when a column of soldiers marched in cadence over the bridge, 
thereby setting up a periodic force of rather large amplitude. The 
frequency of this force was equal to the natural frequency of the bridge. 
Thus, very large oscillations were induced, and the bridge collapsed. It is 
for this reason that soldiers are ordered to break cadence when crossing a 
bridge. 


Epilog. The father of one of my students is an engineer who worked on the 
construction of the Bronx Whitestone Bridge in New York City. He 
informed me that the original plans for this bridge were very similar to those 
of the Tacoma Bridge. These plans were hastily redrawn following the 
collapse of the Tacoma Bridge. 


2.6.2 Electrical networks 


We now briefly study a simple series circuit, as shown in Figure 1 below. 
The symbol £ represents a source of electromotive force. This may be a 
battery or a generator which produces a potential difference (or voltage), 
that causes a current J to flow through the circuit when the switch S is 
closed. The symbol R represents a resistance to the flow of current such as 
that produced by a lightbulb or toaster. When current flows through a coil 
of wire L, a magnetic field is produced which opposes any change in the 
current through the coil. The change in voltage produced by the coil is pro- 
portional to the rate of change of the current, and the constant of propor- 
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Figure 1. A simple series circuit 


tionality is called the inductance L of the coil. A capacitor, or condenser, 
indicated by C, usually consists of two metal plates separated by a material 
through which very little current can flow. A capacitor has the effect of re- 
versing the flow of current as one plate or the other becomes charged. 

Let Q(t) be the charge on the capacitor at time ¢. To derive a differen- 
tial equation which is satisfied by Q(t) we use the following. 


Kirchoff’s second law: In a closed circuit, the impressed voltage equals the 
sum of the voltage drops in the rest of the circuit. 
Now, 


(i) The voltage drop across a resistance of R ohms equals RJ (Ohm’s 
law). 
(11) The voltage drop across an inductance of L henrys equals L(dI/ dt). 
(111) The voltage drop across a capacitance of C farads equals O/C. 


Hence, 
E Q 
(t)= Lg + RI + rok 
and since J(t)=dQ(t)/dt, we see that 
d*Q dQ Q 
tT tka te = E(t). (1) 


Notice the resemblance of Equation (1) to the equation of a vibrating 
mass. Among the similarities with mechanical vibrations, electrical circuits 
also have the property of resonance. Unlike mechanical systems, though, 
resonance is put to good use in electrical systems. For example, the tuning 
knob of a radio is used to vary the capacitance in the tuning circuit. In this 
manner, the resonant frequency (see Exercise 13, Section 2.6) is changed 
until it agrees with the frequency of one of the incoming radio signals. The 
amplitude of the current produced by this signal will be much greater than 


176 


2.6 Mechanical vibrations 


that of all other signals. In this way, the tuning circuit picks out the desired 
station. 


EXERCISES 


1. 


Suppose that a simple series circuit has no resistance and no impressed voltage. 
Show that the charge Q on the capacitor is periodic in time, with frequency 
Wp = V1/LC. The quantity V1/LC_ is called the natural frequency of the 
circuit. 


. Suppose that a simple series circuit consisting of an inductor, a resistor and a 


capacitor is open, and that there is an initial charge Qo= 10~® coulombs on the 
capacitor. Find the charge on the capacitor and the current flowing in the circuit 
after the switch is closed for each of the following cases. 

(a) L=0.5 henrys, C=10~° farads, R= 1000 ohms 

(b) L=1 henry, C=10~4 farads, R=200 ohms 

(c) L=2 henrys, C=10~° farads, R=2000 ohms 


. A simple series circuit has an inductor of 1 henry, a capacitor of 10~° farads, 


and a resistor of 1000 ohms. The initial charge on the capacitor is zero. If a 12 
volt battery is connected to the circuit, and the circuit is closed at t=0, find the 
charge on the capacitor | second later, and the steady state charge. 


. A capacitor of 107? farads is in series with an electromotive force of 12 volts 


and an inductor of 1 henry. At t=0, both Q and / are zero. 

(a) Find the natural frequency and period of the electrical oscillations. 

(b) Find the maximum charge on the capacitor, and the maximum current flow- 
ing in the circuit. 


. Show that if there is no resistance in a circuit, and the impressed voltage is of 


the form Epsinwt, then the charge on the capacitor will become unbounded as 
t>0 if w= V1/LC . This is the phenomenon of resonance. 


. Consider the differential equation 


LO+RO+ £ = Egcos wt. (i) 


We find a particular solution (7) of (i) as the real part of a particular solution 


o(t) of 


LO+RO+ £ = Eye’, (ii) 
(a) Show that E 
iwp(t) = ——_——_~e'*" 
R+ i(wL - ral 


(b) The quantity Z=R+i(wL —1/wC) is known as the complex impedance of 
the circuit. The reciprocal of Z is called the admittance, and the real and im- 
aginary parts of 1/Z are called the conductance and susceptance. De- 
termine the admittance, conductance and susceptance. 


. Consider a simple series circuit with given values of L, R and C, and an im- 


pressed voltage E,)sinwt. For which value of w will the steady state current be a 
maximum? 
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2.7 A model for the detection of diabetes 


Diabetes mellitus is a disease of metabolism which is characterized by too 
much sugar in the blood and urine. In diabetes, the body is unable to burn 
off all its sugars, starches, and carbohydrates because of an insufficient 
supply of insulin. Diabetes is usually diagnosed by means of a glucose 
tolerance test (GTT). In this test the patient comes to the hospital after an 
overnight fast and is given a large dose of glucose (sugar in the form in 
which it usually appears in the bloodstream). During the next three to five 
hours several measurements are made of the concentration of glucose in 
the patient’s blood, and these measurements are used in the diagnosis of 
diabetes. A very serious difficulty associated with this method of diagnosis 
is that there is no universally accepted criterion for interpreting the results 
of a glucose tolerance test. Three physicians interpreting the results of a 
GTT may come up with three different diagnoses. In one case recently, in 
Rhode Island, one physician, after reviewing the results of a GIT, came 
up with a diagnosis of diabetes. A second physician declared the patient to 
be normal. To settle the question, the results of the GIT were sent to a 
specialist in Boston. After examining these results, the specialist concluded 
that the patient was suffering from a pituitary tumor. 

In the mid 1960’s Drs. Rosevear and Molnar of the Mayo Clinic and 
Ackerman and Gatewood of the University of Minnesota discovered a 
fairly reliable criterion for interpreting the results of a glucose tolerance 
test. Their discovery arose from a very simple model they developed for 
the blood glucose regulatory system. Their model is based on the following 
simple and fairly well known facts of elementary biology. 


1. Glucose plays an important role in the metabolism of any vertebrate 
since it is a source of energy for all tissues and organs. For each individual 
there is an optimal blood glucose concentration, and any excessive devia- 
tion from this optimal concentration leads to severe pathological condi- 
tions and potentially death. 

2. While blood glucose levels tend to be autoregulatory, they are also in- 
fluenced and controlled by a wide variety of hormones and other metabo- 
lites. Among these are the following. 

(i) Insulin, a hormone secreted by the B cells of the pancreas. After we 
eat any carbohydrates, our G.I. tract sends a signal to the pancreas to 
secrete more insulin. In addition, the glucose in our blood directly stimu- 
lates the B cells of the pancreas to secrete insulin. It 1s generally believed 
that insulin facilitates tissue uptake of glucose by attaching itself to the im- 
permeable membrane walls, thus allowing glucose to pass through the 
membranes to the center of the cells, where most of the biological and 
chemical activity takes place. Without sufficient insulin, the body cannot 
avail itself of all the energy it needs. 
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(11) Glucagon, a hormone secreted by the a cells of the pancreas. Any ex- 
cess glucose is stored in the liver in the form of glycogen. In times of need 
this glycogen is converted back into glucose. The hormone glucagon in- 
creases the rate of breakdown of glycogen into glucose. Evidence collected 
thus far clearly indicates that hypoglycemia (low blood sugar) and fasting 
promote the secretion of glucagon while increased blood glucose levels 
suppress its secretion. 

(iii) Epinephrine (adrenalin), a hormone secreted by the adrenal medulla. 
Epinephrine is part of an emergency mechanism to quickly increase the 
concentration of glucose in the blood in times of extreme hypoglycemia. 
Like glucagon, epinephrine increases the rate of breakdown of glycogen 
into glucose. In addition, it directly inhibits glucose uptake by muscle 
tissue; it acts directly on the pancreas to inhibit insulin secretion; and it 
aids in the conversion of lactate to glucose in the liver. 

(iv) Glucocorticoids, hormones such as cortisol which are secreted by the 
adrenal cortex. Glucocorticoids play an important role in the metabolism 
of carbohydrates. 

(v) Thyroxin, a hormone secreted by the thyroid gland. This hormone 
aids the liver in forming glucose from non-carbohydrate sources such as 
glycerol, lactate and amino acids. 

(vi) Growth hormone (somatotropin), a hormone secreted by the anterior 
pituitary gland. Not only does growth hormone affect glucose levels in a 
direct manner, but it also tends to “block” insulin. It is believed that 
growth hormone decreases the sensitivity of muscle and adipose membrane 
to insulin, thereby reducing the effectiveness of insulin in promoting 
glucose uptake. 


The aim of Ackerman et al was to construct a model which would ac- 
curately describe the blood glucose regulatory system during a glucose 
tolerance test, and in which one or two parameters would yield criteria for 
distinguishing normal individuals from mild diabetics and pre-diabetics. 
Their model is a very simplified one, requiring only a limited number of 
blood samples during a GTT. It centers attention on two concentrations, 
that of glucose in the blood, labelled G, and that of the net hormonal con- 
centration, labelled H. The latter is interpreted to represent the cumulative 
effect of all the pertinent hormones. Those hormones such as insulin which 
decrease blood glucose concentrations are considered to increase H, while 
those hormones such as cortisol which increase blood glucose concentra- 
tions are considered to decrease H. Now there are two reasons why such a 
simplified model can still provide an accurate description of the blood 
glucose regulatory system. First, studies have shown that under normal, or 
close to normal conditions, the interaction of one hormone, namely in- 
sulin, with blood glucose so predominates that a simple “lumped parame- 
ter model” is quite adequate. Second, evidence indicates that normogly- 
cemia does not depend, necessarily, on the normalcy of each kinetic 
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mechanism of the blood glucose regulatory system. Rather, it depends on 
the overall performance of the blood glucose regulatory system, and this 
system is dominated by insulin-glucose interactions. 

The basic model is described analytically by the equations 


& = F,(G,H)+J (1) (1) 
dH 
af =F, (G,H). (2) 


The dependence of F, and F, on G and H signify that changes in G and H 
are determined by the values of both G and H. The function J (f) is the ex- 
ternal rate at which the blood glucose concentration is being increased. 
Now, we assume that G and H have achieved optimal values G, and H, by 
the time the fasting patient has arrived at the hospital. This implies that 
F\(Go, Ho) =90 and F,(Go, Ho) =0. Since we are interested here in the devia- 
tions of G and H from their optimal values, we make the substitution 
g=G-—Gy, h= H—- Ho. 

Then, 

dg 

a F, (Got g,H oth)+J(t), 
dh 


Now, observe that 
OF) (GpHy) OF (Go, Ho) 

F, (Go+ 8, Hot h)= F, (Go, Ho) + 4G St aR a 
and 
OF, (GoHy) OF, (Go, Mo) 
—4G 8 + aH h+e, 
where e, and e, are very small compared to g and h. Hence, assuming that 
G and H deviate only slightly from G, and A>, and therefore neglecting 
the terms e, and e,, we see that 


dg _ OF; (Go, Ho) OF, (Go, Hg) 


F,(Go+ 8, H+ h) = Fy (Go, Hy) + 


ud...8aG gt — a h+J(t) (3) 
dh _ OF (Go, A ) OF, (Go, Ho) 
dG. (4) 


Now, there are no means, a priori, of determining the numbers 
OF, (Go,Hy) 9F)(Go,Ho) IF (Go, Ho) and OF, (Go, Ho ) 
0G 0H 0G 0H 
However, we can determine their signs. Referring to Figure 1, we see that 
dg / dt is negative for g>0 and h=0, since the blood glucose concentration 
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Tissue 
Uptake 


Hormone 


Endocrines Metabolism 


Figure 1. Simplified model of the blood glucose regulatory system 


will be decreasing through tissue uptake of glucose and the storing 
of excess glucose in the liver in the form of glycogen. Consequently 
OF (Go, H))/9G must be negative. Similarly, 0F\(Go,H))/0H is negative 
since a positive value of h tends to decrease blood glucose levels by facili- 
tating tissue uptake of glucose and by increasing the rate at which glucose 
is converted to glycogen. The number 0F,(Gp, H))/9G must be positive 
since a positive value of g causes the endocrine glands to secrete those 
hormones which tend to increase H. Finally, 0F,(Go,H,))/0H must be 
negative, since the concentration of hormones in the blood decreases 
through hormone metabolism. 
Thus, we can write Equations (3) and (4) in the form 


dg 
Wo M8 mht I(t) (5) 
dh 
Wo ght mas (6) 


where m,, m,, m3, and m, are positive constants. Equations (5) and (6) are 
two first-order equations for g and h. However, since we only measure the 
concentration of glucose in the blood, we would like to remove the vari- 
able h. This can be accomplished as follows: Differentiating (5) with re- 
spect to ¢ gives 


de _ wh , dd 
dt2 ‘dt  *adt~ dt 
Substituting for dh/dt from (6) we obtain that 
dg dg dJ 
Ge ag mash — mma + (7) 


Next, observe from (5) that m,h=(— dg/dt)—m,g+J(t). Consequently, 
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g(t) satisfies the second-order linear differential equation 


dg dg dJ 
72 +(m,+ m3)—, +(m,m,+ m,m,) g=m,J + ih 
We rewrite this equation in the form 
dg, &, , 
—? = 8 
7 +2a Th +wog= S(t) (8) 


where a=(m,+m,)/2, w= m,m,+ mm, and S(1)=m,J+dJ/ dt. 

Notice that the right-hand side of (8) is identically zero except for the 
very short time interval in which the glucose load is being ingested. We will 
learn to deal with such functions in Section 2.12. For our purposes here, let 
t=Q be the time at which the glucose load has been completely ingested. 
Then, for t20, g(Z) satisfies the second-order linear homogeneous equa- 
tion 

d d 

— 7 120g, +weg=0. (9) 
This equation has positive coefficients. Hence, by the analysis in Section 
2.6, (see also Exercise 8, Section 2.2.2) g(t) approaches zero as ¢ ap- 
proaches infinity. Thus our model certainly conforms to reality in pre- 
dicting that the blood glucose concentration tends to return eventually to 
its optimal concentration. 

The solutions g(t) of (9) are of three different types, depending as to 
whether a?—w? is positive, negative, or zero. These three types, of course, 
correspond to the overdamped, critically damped and underdamped cases 
discussed in Section 2.6. We will assume that a?—w? is negative; the other 
two cases are treated in a similar manner. If a? — wo <0, then the character- 
istic equation of Equation (9) has complex roots. It is easily verified in this 
case (see Exercise 1) that every solution g(Z) of (9) is of the form 


g(t)=Ae™ “cos(wt — 4), w* = Ww — a. (10) 
Consequently, 
G(t)=G)+ Ae“ cos(wt — 8). (11) 


Now there are five unknowns Go, A, a, w,, and 6 in (11). One way of de- 
termining them is as follows. The patient’s blood glucose concentration be- 
fore the glucose load is ingested is Gp. Hence, we can determine G, by 
measuring the patient’s blood glucose concentration immediately upon his 
arrival at the hospital. Next, if we take four additional measurements G,, 
G,, G3, and G, of the patient’s blood glucose concentration at times 1, ¢, 
tz, and ¢t,, then we can determine A, a, wo, and 6 from the four equations 


G,= Gy+ Ae” “%cos(wt, — 6), j=1,2,3,4. 


A second, and better method of determining Gp, A, a, wo, and 6 is to take n 
measurements G,,G,,...,G,, of the patient’s blood glucose concentration at 
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times ¢,,¢,,...,4,. Typically n is 6 or 7. We then find optimal values for Go, 
A, , Wo, and 6 such that the least square error 
n 
E= > [G,-G,— Ae~*cos(wt,— 8) | 
j=l 

is minimized. The problem of minimizing E can be solved on a digital 
computer, and Ackerman et al (see reference at end of section) provide a 
complete Fortran program for determining optimal values for Go, A, a, Wo, 
and 6. This method is preferrable to the first method since Equation (11) is 
only an approximate formula for G(t). Consequently, it is possible to find 
values Go, A, &, Wo, and 6 so that Equation (11) is satisfied exactly at four 
points ¢,, f,, ¢;, and ¢, but yields a poor fit to the data at other times. The 
second method usually offers a better fit to the data on the entire time in- 
terval since it involves more measurements. 

In numerous experiments, Ackerman et al observed that a slight error in 
measuring G could produce a very large error in the value of a. Hence, any 
criterion for diagnosing diabetes that involves the parameter a is unreli- 
able. However, the parameter w,, the natural frequency of the system, was 
relatively insensitive to experimental errors in measuring G. Thus, we may 
regard a value of wy, as the basic descriptor of the response to a glucose 
tolerance test. For discussion purposes, it is more convenient to use the 
corresponding natural period 7,=27/w ). The remarkable fact is that data 
from a variety of sources indicated that a value of less than four hours for 
T, indicated normalcy, while appreciably more than four hours implied mild 
diabetes. 


Remark 1. The usual period between meals in our culture is about 4 hours. 
This suggests the interesting possibility that sociological factors may also 
play a role in the blood glucose regulatory system. 


Remark 2. We wish to emphasize that the model described above can only 
be used to diagnose mild diabetes or pre-diabetes, since we have assumed 
throughout that the deviation g of G from its optimal value Gy is small. 
Very large deviations of G from Gp usually indicate severe diabetes or di- 
abetes insipidus, which is a disorder of the posterior lobe of the pituitary 
gland. 


A serious shortcoming of this simplified model is that it sometimes 
yields a poor fit to the data in the time period three to five hours after in- 
gestion of the glucose load. This indicates, of course, that variables such as 
epinephrine and glucagon play an important role in this time period. Thus 
these variables should be included as separate variables in our model, 
rather than being lumped together with insulin. In fact, evidence indicates 
that levels of epinephrine may rise dramatically during the recovery phase 
of the GTT response, when glucose levels have been lowered below fasting 
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Figure 2. Graph of g(?) if a2—w%>0 


levels. This can also be seen directly from Equation (9). If a?—w?>0, then 
g(t) may have the form described in Figure 2. Note that g(t) drops very 
rapidly from a fairly high value to a negative one. It is quite conceivable, 
therefore, that the body will interpret this as an extreme emergency and 
thereby secrete a large amount of epinephrine. 

Medical researchers have long recognized the need of including epi- 
nephrine as a separate variable in any model of the blood glucose regula- 
tory system. However, they were stymied by the fact that there was no relli- 
able method of measuring the concentration of epinephrine in the blood. 
Thus, they had to assume, for all practical purposes, the the level of epi- 
nephrine remained constant during the course of a glucose tolerance test. 
This author has just been informed that researchers at Rhode Island 
Hospital have devised an accurate method of measuring the concentration 
of epinephrine in the blood. Thus we will be able to develop and test more 
accurate models of the blood glucose regulatory system. Hopefully, this 
will lead to more reliable criteria for the diagnosis of diabetes. 


Reference 


E. Ackerman, L. Gatewood, J. Rosevear, and G. Molnar, Blood glucose regulation 
and diabetes, Chapter 4 in Concepts and Models of Biomathematics, F. Heinmets, 
ed., Marcel Dekker, 1969, 131-156. 


EXERCISES 
1. Derive Equation (10). 


2. A patient arrives at the hospital after an overnight fast with a blood glucose con- 
centration of 70 mg glucose/100 ml blood (mg glucose/100 ml blood = 
milligrams of glucose per 100 milliliters of blood). His blood glucose concentra- 
tion | hour, 2 hours, and 3 hours after fully absorbing a large amount of glucose 
is 95, 65, and 75 mg glucose/100 ml blood, respectively. Show that this patient is 
normal. Hint: In the underdamped case, the time interval between two succes- 
Sive zeros of G— Gp exceeds one half the natural period. 
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According to a famous diabetologist, the blood glucose concentration of a 
nondiabetic who has just absorbed a large amount of glucose will be at or 
below the fasting level in 2 hours or less. Exercises 3 and 4 compare the 
diagnoses of this diabetologist with those of Ackerman et al. 


3. The deviation g(t) of a patient’s blood glucose concentration from its optimal 
concentration satisfies the differential equation (d’g / dt?) +2a(dg/dt)+ a’g=0 
immediately after he fully absorbs a large amount of glucose. The time ¢ is 
measured in minutes, so that the units of a are reciprocal minutes. Show that 
this patient is normal according to Ackerman et al, if a > 7/120 (min), and that 
this patient is normal according to the famous diabetologist if 


g'(0) < —(q5 + #) (0). 


4. A patient’s blood glucose concentration G(f) satisfies the initial-value problem 
2 
MG, 1 a, 1 
dt? 20(min) 4 — 2500 (min) 
= | ___45 mg glucose /100 ml blood; 
2500 (min) 
G (0) =150 mg glucose /100 ml blood, 
1 1-—4e!8/5 
200 1 — ¢!8/5 
immediately after he fully absorbs a large amount of glucose. This patient’s opti- 
mal blood glucose concentration is 75 mg glucose/100 ml blood. Show that this 


patient is a diabetic according to Ackerman et al, but 1s normal according to the 
famous diabetologist. 


G'(0)=—aG(0)/(min); a> 


2.8 Series solutions 


We return now to the general homogeneous linear second-order equation 


LL y]= PO +00 + R(Dy=0 (1) 


with P (7) unequal to zero in the interval a<¢< B. It was shown in Section 
2.1 that every solution y(¢) of (1) can be written in the form y(t)=c, y,(0) 
+c, y(t), where y,(t) and y,(t) are any two linearly independent solutions 
of (1). Thus, the problem of finding all solutions of (1) is reduced to the 
simpler problem of finding just two solutions. In Section 2.2 we handled 
the special case where P, Q, and R are constants. The next simplest case is 
when P(t), Q(t), and R(t) are polynomials in ¢. In this case, the form of 
the differential equation suggests that we guess a polynomial solution y(t) 
of (1). If y(t) is a polynomial in ¢, then the three functions P(t)y”(t), 
QO(t)y(t), and R(t)y(t) are again polynomials in ¢. Thus, in principle, we 
can determine a polynomial solution y(Z) of (1) by setting the sums of the 
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coefficients of like powers of ¢ in the expression L[ y](¢) equal to zero. We 
illustrate this method with the following example. 


Example 1. Find two linearly independent solutions of the equation 


d*y | & 
L = — —21— —2y=0. 2 
[»] Pp 217 2y =0 (2) 
Solution. We will try to find 2 polynomial solutions of (2). Now, it is not 
obvious, a priori, what the degree of any polynomial solution of (2) should 
be. Nor is it evident that we will be able to get away with a polynomial of 
finite degree. Therefore, we set 


co 
y(t)=agtattar’t...= > at". 
n=0 
Computing 
d oO 
= =a,+2a,t+3a,t?+...= > na,t"™' 
n=0 
and 
d*y ~ 
7 =2a,+6a3t+ = >> n(n—- l)a,t"~?, 
dt n=0 


we see that y(f) is a solution of (2) if 


LI y|(j= Dd n(n—1)a,t"~?-2t >) na,t"~'-2 >) a,t” 
n=0 n=Q 


n=0 
oO oO oO 
= ¥) n(n—l)a,t"~*-2 > na,t"-2 > a,t"=0. (3) 
n=0 n=0 n=0 


Our next step is to rewrite the first summation in (3) so that the expo- 
nent of the general term is n, instead of n—2. This is accomplished by in- 
creasing every n underneath the summation sign by 2, and decreasing the 
lower limit by 2, that is, 


SY n(n-l)at"-*= DS (n+2)(n+1)a,,5¢". 
n=0 


n=—2 


(If you don’t believe this, you can verify it by writing out the first few 
terms in both summations. If you still don’t believe this and want a formal 
proof, set m=n—2. When x7 is zero, m is —2 and when n is infinity, m is 
infinity. Therefore 


fo 2) io 2) 
YS n(n-l)a,t”-*= DS (m+2)(m+1)a,,5t", 
n=U m=-—2 


and since m is a dummy variable, we may replace it by n.) Moreover, ob- 
serve that the contribution to this sum from n= —2 and n=-—1 is zero 
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since the factor (n+2)(n+ 1) vanishes in both these instances. Hence, 


>» n(n—1)a,t"~2= > (n+2)(n+1)a,,>t" 


n= 


and we can rewrite (3) in the form 


> (n+2)(n+1)a,,,t"—2 Ss na,t”—2 Ss a,t” =0. (4) 
n=0 n=0 
Setting the sum of the coefficients of like powers of t in (4) equal to zero 
gives 
(n+2)(n+ l)a,4.—2na, —2a,=0 
so that 
2(n + 1)a, 2a 


“+2 n+ D)(nt1) +2 (0°) 


Equation (5) is a recurrence formula for the coefficients dp, a,,@,,a3,.... 
The coefficient a, determines the coefficient a,,,. Thus, a) determines a, 
through the relation a,=2a)/2= ap; a), in turn, determines a, through the 
relation a,=2a,/(2+2)=a)/2; and so on. Similarly, a, determines a, 
through the relation a, =2a,/(2+1)=2a,/3; a3, in turn, determines a, 
through the relation a,=2a,/(3+2)=4a,/3-5; and so on. Consequently, 
all the coefficients are determined uniquely once da, and a, are prescribed. 
The values of a, and a, are completely arbitrary. This is to be expected, 
though, for if 
y(t)=aytatt+a,t?+... 


then the values of y and y’ at t=O are a, and a, respectively. Thus, the 
coefficients a) and a, must be arbitrary until specific initial conditions are 
imposed on y. 

To find two solutions of (2), we choose two different sets of values of a 
and a,. The simplest possible choices are (i) dg=1,a,=0; (ii) ag=0, a, =1. 


(1) ay=1, a,=0. 


In this case, all the odd coefficients a,,a3,a5,... are zero since a,=2a,/3= 
0, a; =2a,/5=0, and so on. The even coefficients are determined from the 
relations 


a, 1 2a, 
ih ir Sir i ar Tr Ik 
and so on. Proceeding inductively, we find that 
1 l 


a, == = —.. 
an 2-3--+n~ ni 
Hence, 


y(jalte+ eh 7 +e! 
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is one solution of (2). 

(11) ay = 0, a,=1. 
In this case, all the even coefficients are zero, and the odd coefficients are 
determined from the relations 


_24 2 _ 2a, 22 2a5_ 222 
WB, As= FE, GDH TR, 
3 3 5 5 3 7 75 3 
and so on. Proceeding inductively, we find that 
Qn 


Man+ 1 305... (n+l) 
Thus, 


2p 945 Ss ynj2n+l 
th=t+2-+2—+...= ) —*_ 
y2(4) 3° 3-5 3-5-+-(2n+1) 
is a second solution of (2). 


Now, observe that y,(t) and y,(t) are polynomials of infinite degree, 
even though the coefficients P(t)=1, Q(t) = —2t, and R(t)= —2 are poly- 
nomials of finite degree. Such polynomials are called power series. Before 
proceeding further, we will briefly review some of the important properties 
of power series. 


1. An infinite series 
lo @) 


2 n 
y(t) =ag+ a, (t— ty) +ag(t— ty +... = DB a,(t— fo) (6) 
nji= 
is called a power series about ¢ = fp. 

2. All power series have an interval of convergence. This means that there 
exists a nonnegative number p such that the infinite series (6) converges 
for |t —t)|<p, and diverges for |t — t)| >. The number p is called the 
radius of convergence of the power series. 

3. The power series (6) can be differentiated and integrated term by term, 
and the resultant series have the same interval of convergence. 

4. The simplest method (if it works) for determining the interval of conver- 
gence of the power series (6) is the Cauchy ratio test. Suppose that the 
absolute value of a,,,/a, approaches a limit A as n approaches infinity. 
Then, the power series (6) converges for |t—¢.|<1/A, and diverges for 

5. The product of two power series >) .a,(t— to)” and >) 0b, (t— to)” 
is again a power series of the form }) “oc, (t— to)", with c, =agb, + 
a,b,_,+...+4a,b 9. The quotient 

Aytattayt?+... 
byt b t+ bot? +... 
of two power series is again a power series, provided that b) #0. 
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6. Many of the functions f(t) that arise in applications can be expanded in 

power series; that is, we can find coefficients ay, a,,a),... so that 
oe) 
f(t) =ajt+a,(t—t)) +a,(t—t)) +...= Dd a,(t- to)”. (7) 
n=Q 

Such functions are said to be analytic at t= tp, and the series (7) is called 
the Taylor series of f about ¢= 7. It can easily be shown that if f admits 
such an expansion, then, of necessity, a, =f(t,)/n!, where f(t) = 
d"f(t)/dt". 

7. The interval of convergence of the Taylor series of a function f(t), about 
ty, can be determined directly through the Cauchy ratio test and other 
similar methods, or indirectly, through the following theorem of complex 
analysis. 


Theorem 6. Let the variable t assume complex values, and let z, be the point 
closest to t, at which f or one of its derivatives fails to exist. Compute the 
distance p, in the complex plane, between t, and Z,. Then, the Taylor series 
of f about t, converges for |t — ty|<p, and diverges for |t — to| >p. 


As an illustration of Theorem 6, consider the function f(t)=1/(1+t?). 
The Taylor series of f about t = 0 is 


1 414-1 +... 
1+? 
and this series has radius of convergence one. Although the function 
(1+ 147)~! is perfectly well behaved for ¢ real, it goes to infinity when ¢ = + i, 
and the distance of each of these points from the origin is one. 

A second application of Theorem 6 is that the radius of convergence of 
the Taylor series about t=0 of the quotient of two polynomials a(t) and 
b(t), is the magnitude of the smallest zero of b(t). 

At this point we make the important observation that it really wasn’t 
necessary to assume that the functions P(t), Q(t), and R(Z) in (1) are poly- 
nomials. The method used to solve Example | should also be applicable to 
the more general differential equation 


Liy]=POS +902 + R(t)y=0 


where P(t), Q(t), and R(t) are power series about t,. (Of course, we would 
expect the algebra to be much more cumbersome in this case.) If 


P(t)=potp(t—t)+..., O(t)= dot q(t— to) +..., 
R(t)=rotr,(t— to) +... 


and y(t) =a) +a,(t—ty)+..., then L[ y](t) will be the sum of three power 
series about f =f). Consequently, we should be able to find a recurrence 
formula for the coefficients a, by setting the sum of the coefficients of like 
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powers of ¢ in the expression L[y|(t) equal to zero. This is the content of 
the following theorem, which we quote without proof. 


Theorem 7. Let the functions Q(t)/P(t) and R(t)/ P(t) have convergent 
Taylor series expansions about t= to, for |t—to|<p. Then, every solution 
y(t) of the differential “en 


P= - + o() 2 7 +R(t)y=0 (8) 
is analytic at t=), and the radius of convergence of its Taylor series ex- 
pansion about t= ty is at least p. The coefficients a,,a,,..., in the Taylor 
series expansion 

2 
y(t)=ayt+a,(t— to) +a,(t—-to) +... (9) 


are determined by plugging the series (9) into the differential equation (8) 
and setting the sum of the coefficients of like powers of t in this expression 
equal to zero. 


Remark. The interval of convergence of the Taylor series expansion of any 
solution y(t) of (8) is determined, usually, by the interval of convergence of 
the power series Q(t)/ P(t) and R(t)/ P(t), rather than by the interval of 
convergence of the power series P(t), Q(t), and R(t). This is because the 
differential equation er must be e in the standard form 


“ +p()2 7 + a(t)y=0 


whenever we examine questions of existence and uniqueness. 


Example 2. 
(a) Find two linearly independent solutions of 


d*y 3, ay 1 
=—— + +4 
[] dt? jl+e2dt 14/7 


(b) Find the solution y(t) of (10) which satisfies the initial conditions y(0) 
=2, y’'(0)=3. 

Solution. 

(a) The wrong way to do this problem is to expand the functions 3¢/(1+ ¢’) 
and 1/(1+ ¢?) in power series about t=0. The right way to do this problem 
is to multiply both sides of (10) by 1+ 7? to obtain the equivalent equation 


Ll y]= =(14 2) ~ 431% +yn0. 


y=0. (10) 


We do the problem this way because the algebra is much less cumbersome 
when the coefficients of the differential equation (8) are polynomials than 
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when they are power series. Setting y()= > ~_,a,t”, we compute 


n=Q°n 


Ll y](Q=(1+2) 3 n(n-1)a,t"-2+3¢ > nat” '+ D at” 
n=0 


n=0 n=0 


n(n—l)a,t"-?+ > [n(n- 1)+3n+1 ]a,t” 
n=0 


ll 
Ms 


x 
ll 
7] 


Ms 


(n+2)(n+1)a,,5t"+ > (n+1)a,t". 
=0 


3 
I 
o 


Setting the sum of the coefficients of like powers of ¢ equal to zero gives 
(n+2)(n+ l)a,,,+(n+1)’a, =0. Hence, 


= — = - 1] 
Ont 2 (nn +-2)(n +1) n+2 (11) 
Equation (11) is a recurrence formula for the coefficients a,,a3,... in terms 
of a, and a,. To find two linearly independent solutions of (10), we choose 
the two simplest cases (1) dg=1, a, =0; and (i1) ag=0, a, =1. 
(1) a)= 1, a,=0. 


In this case, all the odd coefficients are zero since a,= —2a,/3=0, a,= 
— 4a,/5=0, and so on. The even coefficients are determined from the rela- 
tions 


and so on. Proceeding inductively, we find that 


-(2n—1) 1 1-3+++(2n-1) 
=(-1) ee a) 
Thus, 
—, 2, 1344 _ < nit3++-(2n—1) 
yi(Q=l-F +s t= CY +" (12) 


is one solution of (10). The ratio of the (n+1)st term to the nth term of 
y(t) is 
3-+-(2n—1)(2n+1)0*"*? aM I —(2n+1)0? 


2"+1(n +1)! 1-3-+-(2n—1)0" —-2A(n4+1) ’ 


and the absolute value of this quantity approaches ?? as n approaches in- 
finity. Hence, by the Cauchy ratio test, the infinite series (12) converges for 
|¢|< 1, and diverges for |¢|>1. 


(11) ay) = 0, a,=1. 
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In this case, all the even coefficients are zero, and the odd coefficients are 
determined from the relations 


__ 7% 2 __ 443 _ 2-4 _ 245 _ _ 2-4-6 
43 3 5 a 7 3-5-7” 
and so on. Proceeding inductively, we find that 


a= (1) ee 
2n+I 3-5---(Qnt]) 3-5++-(2n4+1)" 


Thus, 


, 2 3 2°4 5 . (—1)"2"n! 2n+1 
th=t—-=r+—rt+...= —__________—_- 13 
yi(at— set ae 2555 Qn) (13) 


is a second solution of (10), and it is easily verified that this solution, too, 
converges for |t]|<1, and diverges for |t|>1. This, of course, is not very 
surprising, since the Taylor series expansions about t=0 of the functions 
3¢/(1+ 27) and 1/(1+2) only converge for |t| <1. 

(b) The solution y,(t) satisfies the initial conditions y (0) = 1, y’(0)=0, while 
y(t) satisfies the initial conditions y(0)=0, y’(0)=1. Hence y(t) =2y,(t) + 
3y(t). 


Example 3. Solve the initial-value problem 


d’y _2W 
L[y]=—3 +t a 127 =9: y(0)=1, y’(0) =0. 
Solution. Setting y()= > mont” we compute 
°°) io @) 00 
LI y|(th= > n(n—-Ma,t"-2+0? >) nat" '+2t > a,t" 
n=0 n=Q n=0 


io) Loo) ioe) 
= > n(n-1)a,t"-27+ > nat"*!+2 3S ae"t! 


n=() n=z(Q n=(Q 
ore) ore) 

= > n(n—-1)a,t"-*+ DS (n+2)a,t"*". 
n=(Q n=0 


Our next step is to rewrite the first summation so that the exponent of the 
general term is n+1 instead of n—2. This is accomplished by increasing 
every n underneath the summation sign by 3, and decreasing the lower 
limit by 3; that is, 


co co 
> n(n—1)a,t”-2= S (n+3)(n+2)a,,3t"*! 
n= —3 


n=) 


= D> (nt+3)(nt+2)a,,3071!. 
n=—] 
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Therefore, 


Ll y|()= S (n+3)(n+2)a,,3t"*'+ Ss (n+2)a,t"*! 


n=—1 n=0 
1o 2) io) 
=2a,+ > (n+3)(n+2)a,,,t"7t'+ > (nt+2)a,t"*". 
n=0 n=0 
Setting the sums of the coefficients of like powers of t equal to zero gives 
2a,=0, and (n+3)(n+2)a,,,+(n+2)a,=0; n=0,1,2,... 


Consequently, 
a 


n 
a, =0, and An+3~ ~ 43? n20. (14) 
The recurrence formula (14) determines a, in terms of ao, a, in terms of 
a,, as in terms of a,, and so on. Since a, =0, we see that as, ag,a),,... are all 


zero, regardless of the values of a, and a,. To satify the initial conditions, 
we set d)=1 and a, =0. Then, from (14), a4,a5,a;9,... are all zero, while 
— 4% 1 — 4 | dg l 
B= 373 ~~~ G36 BO 3-69 
and so on. Proceeding inductively, we find that 


(-1I) (-1"— (“1 


a3 ee = 


n3.6---3n 3"1-2--+n 3"! 
Hence, 


_, 6, ¢ 19 eh 
y(t)=1 3 +3. Septet Doe 


n=0 


By Theorem 7, this series converges for all t, since the power series ¢* and 
2t obviously converge for all ¢. (We could also verify this directly using the 
Cauchy ratio test.) 


Example 4. Solve the initial-value problem 
Dy by 
= (¢2—9 7) — —])—~— =(): — "“1)\= 
LI y] (t*—2t) 72 +5(t a +3y=0; yQ)=7, y(1)=3. (15) 


Solution. Since the initial conditions are given at t=1, we will express the 
coefficients of the differential equation (15) as polynomials in (t—1), and 
then we will find y(t) as a power series centered about t=1. To this end, 
observe that 


t? —2t=t(t—2)=[(¢-1) +1 ][(t-1)-1 J =(¢- 1) 1. 


Hence, the differential equation (15) can be written in the form 
[y 
Ly]  ( ) 1] 72 +5(t—1) 7 + 3y =0. 
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ie @) 


Setting y()= > o4,(t— 1)”, we compute 
Ll y= [1-1] & nn l)a,(t—1)"~? 
+5(t—1) D na,(e—1)""43 > a(t) 
=~ D n(n Nay(e—"? 


+ > n(n—l)a,(t-1)"+ dS (5n4+3)a,(t—-1)" 
n=0 n=0 


= — Ss (n+2)(n+1)a,,,(t- 1)"+ 
n=0 


(n?+4n+3)a,(t—1)’. 
n=0 
Setting the sums of the coefficients of like powers of ¢ equal to zero gives 
—(n+2)(nt l)a,,,+(n?+4n+3)a, =0, so that 


n?+4n+3 _nt+3 


= ——__————_q, = ——-4,, > 0. 16 
n+2 (n+2)(n+1) ay, n+20" n ( ) 


To satisfy the initial conditions, we set ag=7 and a,=3. Then, from (16), 


3, 3 _5) _ 5:3 _7. _7:5:3 
B= 747 MQ“ Ggh = 64 647! 
424.3 6 = 8:4. _8 _ 8-6-4 
ic a hs a i ST Bc 


and so on. Proceeding inductively, we find that 


3°5---(2n+1) F 4-6---(2n+2) corn>l 
A>, = 7-4 (n) an @nt1= 33nd) (for n> 1). 
Hence, 
y(t)=743(t= 1) + 5-7(t- IP + $30 1) +... 

2. 365+++(2n+1)(t—-1)" 2) 2"(n+1)'(t-1)"*! 
=74+7 > 4 3-1) +3) — 

2 2"n! (71) 2 3-5+--(2n+1) 


Example 5. Solve the initial-value problem 
d’y 
L[y]=-)7 5 + a td y =0; y(0)=1, y'(0)=1. 


Solution. Setting y(t)= > "_,a,t”, we compute 


n=OQ'°n 
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Li y|(=(1 —t) > n(n l)a,t”~? 


n 


00 
+ > na,t"~'+(1-12) 


00 
> a,t 
n=0 n=0 
oe) 0 
= ¥ n(n—-l1)a,t"-7-— S n(n—-1)a,t"™! 
n=0 n=0 


o@) io.) 
na,t"'+ Sat”— > at"?! 
n=0 n=0 


io @) io 6) 
>» (n+2)(n+1)a,,5t"- > n(n—2)a,t"~' 
n=0 n=0 


(oe) oe) 
+ Sat"- > at"! 
n=0Q 


n=0 


BS (n+2)(nt lay gat™— Di (nt 1)(n= Vey gt 
n=0 


n=0 
ove) re) 

+ Sat"- Dd a,_,t” 
n=0 n=] 

=2a,+a,+ da 

ove) 

+ > {(n+2)(n+ l)a,4.—(n+1)(n—1)a,,,+4,-4,_,}t". 
n=] 


Setting the coefficients of each power of ¢ equal to zero gives 


a, +a (n+1)(n—1)a,,,-4,+4,_, 
a, = — 7 and on a d(ntl n>}. 
(17) 
To satisfy the initial conditions, we set aj=1 and a,=1. Then, from (17), 
— a, +a 3a,-a,+a, | 
a,=—l, a,= ————— = 0, 7 a a 
8a,—a,+a, | I5a,;—a,t+a, 1 
‘s20—~«<“‘~ 80860 


and so on. Unfortunately, though, we cannot discern a general pattern for 
the coefficients a, as we did in the previous examples. (This is because the 
coefficient a,,. depends on the values of a,,,, a,, and a,_,, while in our 
previous examples, the coefficient a, ,, depended on only one of its prede- 
cessors.) This is not a serious problem, though, for we can find the 
coefficients a, quite easily with the aid of a digital computer. Sample Pascal 
and Fortran programs to compute the coefficients a,,...,a, in terms of a, 
and a,, and to evaluate the “approximate” solution 


y(t)=aytatt+...+a,t" 
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at any point ¢ are given below. These programs have variable values for dp 
and a,, so they can also be used to solve the more general initial-value 
problem 


d*y dy 
(I-) 5 + Gt dy =0; y(0)=ay, y’(0)=a}. 


Pascal Program 


Program Series (input, output); 


var 
A: array[0.. 199] of real; 
T, sum: real; 
k, N: integer; 


begin 
readin(A[0], A[1], T, N); 
page; 
A[2] := —0.5«(A[1] + A[0]); 
sum :=A[0]+A[1] *T+A[2] + T + T; 
for k:=1 to N—2 do 


begin 
A[k +2] :=((k+1) * (k—1)* A[k+ 1] —A[k] + A[k— 1]) 
M((k+1)*(k+2)); 
sum :=sum-+A[k+ 2] « exp((k + 2) « In(T)); 
end; 


writeln(‘For N=’ ,N:3, ’ and T=’ ,T:6:4); 
writeln(‘the sum is: ’,sum:11:9); 
end. 


Fortran Program 


DIMENSION A(200) 
READ (5, 10) AO, A(1), T, N 
10 FORMAT (3F15.8, 15) 
A(2)= —0.5*(A(1)+A0) 
A(3) =(A0— A(1)) /2. «3. 
SUM =A0+A(1)* T+ A(2)*T# #2 +A(3)#T #3 
NA=N—2 
DO20 K=2,NA 
A(K + 2)=(A(K—1)— A(K) +(K +1.) #(K—1.) * 
1 | AK+1))/(K+1.)*(K+2.) 
SUM =SUM + A(K + 2)*T* *(K+2) 
20 CONTINUE 
WRITE (6,30) N,T,SUM 
30 FORMAT (1H1, ‘FOR N=’, 13, ‘, AND T=’, F10.4/1H, ‘THE 
1 | SUM IS’, F20.9) 
196 CALL EXIT 
END 
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See also C Program 14 in Appendix C for a sample C program. 
Setting A[0]=1, A[1]=1, (A(1)=1 for the Fortran program), T=0.5, and 
N = 20 in these programs gives 


y(4) = ayta,(4) +... + ayo(4)” = 1.26104174. 


This result is correct to eight significant decimal places, since any larger 
value of N yields the same result. 


EXERCISES 
Find the general solution of each of the following equations. 
Ly’ +y’+y=0 2. y’-ty=0 
3. (2+ 22) y” —b’—3y =0 4. y”-fy=0 
Solve each of the following initial-value problems. 
3. ((2—ty” —6(t—I)y’—4y =0; y=], y'()=0 
6. y”+?y=0; y(0)=2, y'(0)=—1 
7. y”—Py=0; y(0)=0, y'(O)= —2 
By" +(02+264+l)y’—(444)y=0; y(—1=0, y(-)l=1 


9. The equation y” —2ty’+Ay =0, A constant, is known as the Hermite differential 
equation, and it appears in many areas of mathematics and physics. 
(a) Find 2 linearly independent solutions of the Hermite equation. 
(b) Show that the Hermite equation has a polynomial solution of degree n if 
A=2n. This polynomial, when properly normalized; that is, when multi- 
plied by a suitable constant, is known as the Hermite polynomial H,,(?). 


10. The equation (1 — t?)y” —2’+ a(a+ 1)y=0, a constant, is known as the 
Legendre differential equation, and it appears in many areas of mathematics 
and physics. 

(a) Find 2 linearly independent solutions of the Legendre equation. 

(b) Show that the Legendre differential equation has a polynomial solution of 
degree n if a=n. 

(c) The Legendre polynomial P,,(7) is defined as the polynomial solution of the 
Legendre equation with a=n which satisfies the condition P,(1)=1. Find 
Po(t), Pi(t), P(t), and P(t). 


11. The equation (1 — ¢”) y” — ty’ + a*y =0, a constant, is known as the Tchebycheff 
differential equation, and it appears in many areas of mathematics and physics. 
(a) Find 2 linearly independent solutions of the Tchebycheff equation. 
(b) Show that the Tchebycheff equation has a polynomial solution of degree n 
if a=n. These polynomials, when properly normalized, are called the 
Tchebycheff polynomials. 
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12. (a) Find 2 linearly independent solutions of 
y"+ Py’ +3ty=0. 


(b) Find the first 5 terms in the Taylor series expansion about t=0 of the solu- 
tion y(t) of the initial-value problem 


y"+Py'+3ry=es  y(0)=0, y’(0)=0. 


In each of Problems 13-17, (a) Find the first 5 terms in the Taylor series 


expansion >.- o%,t” of the solution y(t) of the given initial-value prob- 
lem. (b) Write a computer program to find the first N+1 coefficients 
Ag, 41,---,@y, and to evaluate the polynomial Ag+ att .. .tayt™. Then, 


obtain an approximation of yG ) by evaluating >” 1 =04 a,(3 y. 
13. d—b)y"+5p’+y=0; yO=1, yXO=0 

14. y’+y’+tH=0; yO=-—1, y'O)=2 

15. y"+H’+e'y=0; yO =I, y’(0)=0 

16. y’+y’+e‘y=0; y(O)=0, y'(O=-1 

17. y"+y’+e ‘y=0; y(0)=3, yO=5 


2.8.1 Singular points, Euler equations 


The differential equation 
a 
Lly]=P(t)— 5 » + (0) 7+ R(t)y=0 (1) 


is said to be singular at t=7,) if P(t,)=90. Solutions of (1) frequently 
become very large, or oscillate very rapidly, in a neighborhood of the 
singular point ¢,). Thus, solutions of (1) may not even be continuous, let 
along analytic at ¢,, and the method of power series solution will fail to 
work, in general. 

Our goal is to find a class of singular equations which we can solve for t 
near f,. To this end we will first study a very simple equation, known as 
Euler’s equation, which is singular, but easily solvable. We will then use the 
Euler equation to motivate a more general class of singular equations which 
are also solvable in the vicinity of the singular point. 


Definition. The differential equation 
2 d*y 
Ll y]=0? 7 +t + By= 0. (2) 


where a and £ are constants is known as Euler’s equation. 
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We will assume at first, for simplicity, that f > 0. Observe that r*y” and 


ty’ are both multiples of ¢’ if y=+’. This suggests that we try y=t’ as a 
solution of (2). Computing 


d | d? : 
—r’=rt’' and —t’=r(r—1)t'~? 
i (r—1) 


dt 
we see that 
Lit’) =r(r—-1)t'+ art’ + Br’ 
=[r(r—-l)+ar+ Blt’ 
= F(r)t" (3) 
where 


F(r)=r(r-—1)+ar+8B 
=r?+(a—1)r+B. (4) 


Hence, y=?’ is a solution of (2) if, and only if, 7 is a solution of the 
quadratic equation 


r>+(a—1)r+p=0. (5) 


The solutions r,, 7, of (5) are 


r= —3](a-1)+ (a1)? 48 | 
n= §{(a-1)-V(a-1)-48 |, 


Just as in the case of constant coefficients, we must examine separately the 
cases where (a — 1)? — 48 is positive, negative, and zero. 


Case 1. (a—1)?—48 > 0. In this case Equation (5) has two real, unequal 
roots, and thus (2) has two solutions of the form y,(t)=¢t", y(t) =”. 
Clearly, ¢”' and ¢” are independent if r, #,r,. Thus the general solution of 
(2) 1s (for t > 0) 


y(t)=ett+ct”. 


Example 1. Find the general solution of 


d* 
LLy]= 2 +a 42y=0, t>0. (6) 


Solution. Substituting y = t" in (6) gives 
L[t’])=r(r—1)e’+4rt’? +20" 
=[r(r -1)+4r +2]2’ 
=(r24+3r4+2)t’ 
=(r4+1)(r4+2)r’ 
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Hence r, = —1, 7, = —2 and 


Cc,  C¢ 
is the general solution of (6). 


Case 2. (a—1)? —48 = 0. In this case 
l-—a 

2 
and we have only one solution y = ¢” of (2). A second solution (see Exercise 
11) can be found by the method of reduction of order. However, we would 
like to present here an alternate method of obtaining y, which will generalize 
very nicely in Section 2.8.3. Observe that F(r)=(r—r,)* in the case of 
equal roots. Hence 


L[t’]=(r-7,)'r’. (7) 
Taking partial derivatives of both sides of (7) with respect to r gives 
2 Lf" =1) 21] =2[(r-n)r4. 
Since 0(t”)/dr = t'ln t, we see that 
L[t’Int] =(r—r,) tnt +2(r—1,)t”. (8) 
The right hand side of (8) vanishes when r = r,. Hence, 
L{t"Int]=0 


which implies that y,(¢)=?f"ln¢ is a second solution of (2). Since t” and 
t™'In ¢ are obviously linearly independent, the general solution of (2) in the 
case of equal roots is 


y(t)=(e,+c,Int)t’, t>0. 


Example 2. Find the general solution of 


dz 
L[y] =P 2 —51F + 9y=0, t>0. (9) 


Solution. Substituting y = t’ in (9) gives 
L[t'|=r(r—-1)t!—Srt’ +90" 
=[r(r—1)—5r+9]t’ 
=(r*—6r+9)t" 
=(r—3)'t’. 
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The equation (r — 3)? = 0 has r =3 as a double root. Hence, 
y(t)=0°, y(t) =e Int 
and the general solution of (9) is 


y(t)=(e,+e,Int)r*, t>0. 


Case 3. (a—1)* —48 <0. In this case. 
r;=At+im and r=A—ip 
with 


—t-a _ [48-(a-1)]'” 
i ee) a (10) 


are complex roots. Hence, 
(t) — prtin — pApin 
= th(elnt)’ = prpgivint 
= t*[cos( int) +isin(yln 2) 
is a complex-valued solution of (2). But then (see Section 2.2.1) 
y(t) =Re{o(z)} =r*cos( In r) 
and 
y,(t) = Im{o(t)} =2'sin(pinr) 


are two real-valued independent solutions of (2). Hence, the general solution 
of (2), in the case of complex roots, is 


y(t) =| c,cos(pln t)+ c,sin(pln r)| 
with A and p given by (10). 


Example 3. Find the general solution of the equation 
L[ y] =t?y”—5ty’+25y = 0, t>0. (11) 
Solution. Substituting y = t’ in (11) gives 
Lit’) =r(r—1)t"—5rt”+25t" 
=[r(r—1)—5r +25] t’ 
=[r?—6r4+25]t’ 
The roots of the equation r* —6r +25=0 are 


6+ /36— 100 


2 


=3+4i 
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so that 
o(t) — por4i _ 13¢4 
— p3e(n1)4i = peei4in t) 


= ¢3[cos(41n )+ isin(41n t)] 
is a complex-valued solution of (11). Consequently, 


y(t) =Re{$(t)} =2?cos(41n r) 
and 
y,(t) = Im{$(t)} =2? sin(4Inr) 
are two independent solutions of (11), and the general solution is 
y(t) =t?[ c,cos(4Int)+c,sin(4Int)],  1>0. 


Let us now return to the case t< 0. One difficulty is that ¢” may not be 
defined if t is negative. For example, (— 1)!/? equals i, which is imaginary. A 
second difficulty is that Inz is not defined for negative t. We overcome both 
of these difficulties with the following clever change of variable. Set 


t=—-x, x>0, 
and let y= u(x), x > 0. Observe, from the chain rule, that 


dy _ du dx du 


dt dxdt ax 
and 


fyi (_ a) £(- du) oe d*u 
dt2 at dx dx dx} dt — qx2 


Thus, we can rewrite (2) in the form 


(— x £4 + a(-x)(-“) +Au=0 
or 
2 Ms By =0 x>0 (12) 
2 dx 


But Equation (12) is exactly the same as (2) with ¢ replaced by x and y 
replaced by u. Hence, Equation (12) has solutions of the form 


CX + 05x" 
u(x) = 4(¢,+¢,Inx)x” (13) 
[ c;cos(win x) + c,sin(pln x)] x 
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depending on whether (a— 1)? —48 is positive, zero, or negative. Observe 
now that 


x=-t=|t| 
for negative t. Thus, for negative t, the solutions of (2) have one of the forms 
c,|t|" +c |t|” 
[e, + coln|e|]|e|” 
[ ccos( win |2}) + cysin( win |e] )] |2|* 


Remark. The equation 


(1-1 2X + a(t 19) 7 + By= 0 (14) 


is also an Euler equation, with a singularity at ¢ =, instead of ¢ = 0. In this 
case we look for solutions of the form (t—1f,)’. Alternately, we can reduce 
(14) to (2) by the change of variable x =t — fp. 


EXERCISES 


In Problems 1-8, find the general solution of the given equation. 


1. ¢?y"+5ty’-Sy =0 2. 2t7y"+3ty’- y=0 

3. (t-1)*y"”-At—-l)y’+2y=0 4. t?y"+3ty’+ y=0 

5. t7y"”—ty’+ y=0 6. (t—2)*y"+5(t 2) y+ 4y =0 
7. t°y"+ty’+ y=0 8. ty" 4+3r'/+2y=0 

9 


. Solve the initial-value problem 
t*y”—ty’—2y=0; y(1)=0, y(I)=1 
on the interval 0<t<oo. 
10. Solve the initial-value problem 
t?y”—3ty’+4y=0; y(1I)=1, y'(1)=0 
on the interval 0<t<oo. 


11. Use the method of reduction of order to show that y,(t) =7"In¢ in the case of 
equal roots. 


2.8.2 Regular singular points, the method of Frobenius 


Our goal now is to find a class of singular differential equations which is 
more general than the Euler equation 


d* d 
poo tata, + By=0 (1) 
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but which is also solvable by analytical techniques. To this end we rewrite 
(1) in the form 


d*y ady  B _ 
atta? 2? (2) 


A very natural en of (2) is the equation 


L{y]= > 2+ (i) (t+ a(t)y=0 (3) 
where p(t) and q(t) can be O anded in series of the form 


P 
P(t)= "+ p+ pot + pst? + _ 


q(t) = "P+ tant ast t gat? +: (4) 


Definition. The equation (3) is said to have a regular singular point at t = 0 
if p(t) and q(t) have series expansions of the form (4). Equivalently, ¢ = 0 
is a regular singular point of (3) if the functions ¢p(t) and t’g(t) are 
analytic at ¢ = 0. Equation (3) is said to have a regular singular point at 
t = ty if the functions (t — ty) p(t) and (t — ty)’q(t) are analytic at ¢ = fo. 
A singular point of (3) which is not regular is called irregular. 


Example 1. Classify the singular points of Bessel’s equation of order vy 


2 d*y 

pe 1G +(e - v?)y=0, (5) 
where pv is a constant. 

Solution. Here P(t)=t? vanishes at t= 0. Hence, t = 0 is the only singular 
point of (5). Dividing both sides of (5) by ¢? gives 


d*y 1 dy pr \ 
dt , mal “|r=0 


Observe that 

tp(t)=1 and ¢’g(t)=t?—v? 
are both analytic at t=0. Hence Bessel’s equation of order v has a regular 
singular point at t= 0. 


Example 2. Classify the singular points of the Legendre equation 


(- pe 21D +a(a+1)y=0 (6) 


where a is a constant. 
Solution. Since 1—t* vanishes when t=1 and —1, we see that (6) is 
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singular at t= +1. Dividing both sides of (6) by 1— 1? gives 
d*y 2r dy _ (atl) y= 


a 
dt? 1—1? at 1—?? 


Observe that 


(1—1)p(t) = 21 


and 


(t—1)?¢(t) = (a1) EE iy ~a(at+1)i—t 


es 


are analytic at ¢=1. Similarly, both (¢ +1)p(t) and (t + 1)’q(t) are analytic 
at t= —1. Hence, t=1 and t= —1 are regular singular points of (6). 


Example 3. Show that ¢ = 0 is an irregular singular point of the equation 
24Y DK 
7? +3— HTD =0. (7) 
Solution. Dividing through by ¢? gives 
d*y 3d 1 | 
Wi + 2 di ty =0. 
In this case, the function 


n= 3)=3 


t 


is not analytic at t= 0. Hence ¢ = 0 is an irregular singular point of (7). 


We return now to the “suave 


Lly]=<4 2+ p(t) pt q(t)y=0 (8) 


where t=O is a regular seclas point. For simplicity, we will restrict 
ourselves to the interval t>0. Multiplying (8) through by #7 gives the 
equivalent equation 


Lly]=P £2 + a(i(1)) 2 + Paley =0. ) 


We can view Equation (9) as being obtained from (1) by adding higher 
powers of ¢ to the coefficients a and £. This suggests that we might be able 
to obtain solutions of (9) by adding terms of the form t’*!, ¢’*7,... to the 
solutions ¢” of (1). Specifically, we will try to obtain solutions of (9) of the 
form 


ore) ore) 
y(tv= Daerr=t" ¥ a,t". 
n=0 n 


=0 
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Example 4. Find two linearly independent solutions of the equation 


d? d 
Lly]=2— 2 + 9 +9 =0, 0<t<o. (10) 


Solution. Let 


y(t)= > a,t"*’, ay #0. 
n=0 


Computing 
y(t)= 3d (ntr)a,t 
n=0 

and 

[o.@) 

y(t)h= DX (ntr)\(ntr—-1)a,t? 

n=0 

we see that 


L[y]= if ¥ (n +r)(n+r—l)a,t"™' 


(o @) (o @) 
+ ¥ (ntr)a,t” '+ > a 
n=O n=0Q 


= if > (n+r)(nt+r—-1)a,t"' 
n=0 


oe) o @) 
+ > (ntr)a,t"™ '+ > oat 
n=O n=2 


=[2r(r—1)ay t+ ray|t’?-'+[2(1+r)ra,+(1t+r)a,]e’ 


(o @) 
+> [2(n +r)(n+r—-l)a,+(nt+r)a,+ a,—»\t"t”! 
n=2 


Setting the coefficients of each power of t equal to zero gives 


Gi) 2r(r—l)ayg + rag=rQ2r—l1)ay =), 
(i) 207+ ))ra,+(rt+la,=(rt+)Qrt+a, =), 
and 
(ii) 2n+r)\n+r—l)a,+(nt+r)a,=(n+r)[2(n+r)—lla,=—a,_,, 
n= 2. 


The first equation determines r; it implies that r=0 or r=4. The second 
equation then forces a, to be zero, and the third equation determines a,, for 
n=2. 
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(i) r=0. In this case, the recurrence formula (iii) is 
— an-2 

a LL 

n ~ n(n —1)’ 


Since a,=0, we see that all of the odd coefficients are zero. The even 
coefficients are determined from the relations 


n=2. 


92 923° 9447 9-43-77? 9 61 2-4-6-3-7-11 
and so on. Setting a, =1, we see that 
t? r4 Cy" 
t)=1-~- + ~~ —}4 y —Le 
vit) 23° 243-7— b> , 2'n!3-7 +++ (4n—1) 


is one solution of (10). It is easily verified, using the Cauchy ratio test, that 
this series converges for all ¢. 


(ii) r=4. In this case, the recurrence formula (iii) is 
~ an ~ ayn—2 


“(nt 8) 2(n4+ 4-1] nQn+1)’ 


Again, all of the odd coefficients are zero. The even coefficients are 
determined from the relations 

74 74,_ a4 — ap 

2-57 94 4-9 ~ 2-4:5-9° 96" 6-13 2-4-6-5-9-13 
and so on. Setting a, =1, we see that 


a,— 


14 
— 41/2 poo a, 
yalt)=t oe 2-5 ° 2-4-5-9 | 


1+ 3 (“yr 


— 71/2 7 a 
 2"n!15-9-++(4n+1) 


is a second solution of (10) on the interval 0<t<oo. 


Remark. Multiplying both sides of (10) by ¢ gives 


924) 4 ¥ 


2,,— 
” arya. 


This equation can be viewed as a generalization of the Euler equation 


d*y , dy _ 

2 

2t 2 tt dt 0. (11) 

Equation (11) has solutions of the form t’, where 
2r(r—1)+r=0 


This equation is equivalent to Equation (i) which determined r for the 
solutions of (10). 
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Let us now see whether our technique, which is known as the method of 
Frobenius, works in general for Equation (9). (We will assume throughout 
this section that t>0.) By assumption, this equation can be written in the 
form 


d? d 
Lly]=? <5 +¢L p+ pit + prt? + JZ 4 [gg tqytt+qnt? + +» |y=0. 


12 dt 
Set 
CO 
y(t)= > a,t"t’, with a, 40. 
n=0 
Computing 
oO 
y(t)= ¥ (n+r)a,er 
n=0 
and 
co 
y (a= > (ntr)\(nt+r—1)a,t"*"? 
n=0 
we see that 


LL" » (nt Nlntr— Tage | 


n=0 


+( 3 anit [ 5 at")} 


Multiplying through and collecting terms gives 


7 Ms 


‘Pat Px + rant" 


L[ y]=[r(r-1)+ por + qo] aot” 
+ ([+ r)r + po(l+r)+ go]; (rp, + 4y)ao}t" 


+ [LlntNrer—D+ paint) ala 


n—-]| 


+ > [eet ayaa” 
k=0 
toon. 


This expression can be simplified if we set 
F(r)=r(r—-1)+ por + qQ. (12) 
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Then, 
Ll y]=a,F(r)t'+[a, FU +r)+(rp,+4q,)aoltit’+ --- 


n—-1 


+a,F(n+ pire] > (A+r) pe + dnc ae 
k=0 


+... 


Setting the coefficient of each power of t equal to zero gives 


F(r)=r(r—1)+ por + qo =9 (13) 
and 
n—1 
F(ntr)a,=— > [(kK+r)p,-~+4n—-xlay, n=. (14) 
k=0 


Equation (13) is called the indicial equation of (9). It is a quadratic 
equation in r, and its roots determine the two possible values r, and r, of r 
for which there may be solutions of (9) of the form 


10.8) 
S at??? 
n=0 
Note that the indicial equation (13) is exactly the equation we would obtain 
in looking for solutions t” of the Euler equation 


ay 


dy 
ye | Pot q+ 40 = 0. 


Equation (14) shows that, in general, a, depends on r and all the preceding 
coefficients dy, a,,...,a,—,. We can solve it recursively for a, provided that 
F+t+r), F2+,1r),...,F(n +r) are not zero. Observe though that if F(n +r) 
= 0 for some positive integer n, then n+ r is a root of the indicial equation 
(13). Consequently, if (13) has two real roots r,, 7, with r, >r, and r, — 7, not 
an integer, then Equation (9) has two solutions of the form 
oO o6) 
y(t)=t" XY a,(r)t", w(t)=t? DY a,(7)e", 
n=0 n=0 

and these solutions can be shown to converge wherever tp(t) and ¢7q(t) 
both converge. 


Remark. We have introduced the notation a,(r,) and a,(7,) to emphasize 
that a, is determined after we choose r =r, or 1/5. 


Example 5. Find the general solution of the equation 


dy 


yo toy = 0. (15) 


Lly]=452 +39 
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Solution. Equation (15) has a regular singular point at t= 0 since 


wp(t)=4 and t’q(t)=it 
are both analytic at t=0. Set 


(oa) 
y(t)= ¥ a,t"t’, a, #0. 
n=0 


Computing 
io @) 
y(t)= > (ntr)a,t7! 
n=O 
and 
1°. 6) 
y (t= 3 (nt+r)(n+r—l)a,t"?’~? 
n=0 
we see that 


io 6) 
Ll y]=4 3 (ntr)(nt+r—1)a,t"*" 7! 
n=0 
oO (6) 
+3 > (nt+r)a,t"*" '+3 3 a,t"t" 
n=0 n=0 


[4(n+r)(nt+r—1)+3(n4+r)]a,t77 1+ 3S 3a, 7b. 
0 n=1 


I 
li M48 


n 
Setting the sum of coefficients of like powers of t equal to zero gives 


4r(r—1)+3r =4r? —r=r(4r—1)=0 (16) 


and 


[4(n+r)(n+r—1)+3(n+r)]a,=(n+r)[4(n+r)—l]a,=—3a,_,, 
n>1. (17) 
Equation (16) is the indicial equation, and it implies that r=0 or r=}. 


Since these roots do not differ by an integer, we can find two solutions of 
(15) of the form 


co 
> at"? 
n=0 
with a, determined from (17). 
r =Q. In this case the recurrence relation (17) reduces to 


a= —3 an—| _ —3a,—| 
" 4n(n—1)+3n n(4n—-1)° 
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Setting a, =1 gives 


_ —3a, _ ] 
a,=-l, a, — 2.7 =3577> 
73 3-71 SOTA’ 

_ —3a3 3 1 
94 = 495 2 95-47 1115” 


and, in general, 


1= (—1)"3"7! 
n ~~ nl7-11-15+--(4n—1)" 
Hence, 
s (—1)"3"7! 
= of” 18 
WC= Bas (ana) (8) 


is one solution of (15). It is easily seen, using the Cauchy ratio test, that 
y,(t) converges for all ¢. Hence y,(t) is an analytic solution of (15). 


r=4. In this case the recurrence relation (17) reduces to 


—34n-1 734n-1 1 


Setting a) =1 gives 
_=3 3? 3 
~ “3 9-3-5-9- 13’ 


34 


44 ~ 9.3-4-5-9-13-17° 


Proceeding inductively, we see that 


1= (—1)"3" 
" n!5-9-13 +++ (4n +1)" 


Hence, 


a —1)"3" 
1)=1t!/4 ee Gt) 
volt) 2 5-9-13-- (nF) 


is a second solution of (15). It can easily be shown, using the Cauchy ratio 
test, that this solution converges for all positive t. Note, however, that y,(t) 
is not differentiable at ¢ = 0. 

The method of Frobenius hits a snag in two separate instances. The first 
instance occurs when the indicial equation (13) has equal roots r,=7r,. In 
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this case we can only find one solution of (9) of the form 
oO 
y(e)=t" BY a,c”. 
n=0 


In the next section we will prove that (9) has a second solution y,(t) of the 
form 


co 
y,(t)=y (tine +1" XY 6,0 
n=0 

and show how to compute the coefficients b,. The computation of the b, is 
usually a very formidable problem. We wish to point out here, though, that 
in many physical applications the solution y,(7) is rejected on the grounds 
that it is singular. Thus, it often suffices to find y,(t) alone. It is also 
possible to find a second solution y,(t) by the method of reduction of order, 
but this too is usually very cumbersome. 

The second snag in the method of Frobenius occurs when the roots r,, 7, 
of the indicial equation differ by a positive integer. Suppose that r, = 7, + N, 
where AN is a positive integer. In this case, we can find one solution of the 
form 


(6) 
y(t) =0" > a,t". 
n=0 
However, it may not be possible to find a second solution y,(t) of the form 
oO 
y(t)=1?  b,t”. 
n=0 


This is because F(r, +n)=0 when n=N. Thus, the left hand side of (14) 
becomes 
N-1 
O-ay=— [(k +1) Pye In—a] Oe (19) 
k=0 
when n= N. This equation cannot be satisfied for any choice of ay, if 


N-1 
> [(k + ry) Pn—z + dn—x a, #0. 
k=0 


In this case (see Section 2.8.3), Equation (9) has a second solution of the 
form 


y(t) = y,(¢)nt+t? > 5,2" 
n=0 


where again, the computation of the b, is a formidable problem. 


212 


2.8 Series solutions 


On the other hand, if the sum on the right hand side of (19) vanishes, 
then a,, is arbitrary, and we can obtain a second solution of the form 


ee) 
y(t)=t? > bt". 
n=0 


We illustrate this situation with the following example. 
Example 6. Find two solutions of Bessel’s equation of order +, 


ple +B a(e —1/4)y=0, 0<t<oo. (20) 
Solution. This equation has a regular singular point at ¢ = 0 since 

ip(t)=1 and fr’q(t)=21?-4 
are both analytic at t= 0. Set 


y(t)= > a,t"*’, a, #0. 


Computing 
[o @) 
y(t)= >) (n+r)a,t"*"—! 
n=O 
and 
[o@) 
y(th= 3B (ntr)\(n+r—-1)a,t"*"-? 
n=0 
we see that 
oO 
L= 3 (ne nintr—nayers S (meray 
n=O n=0 


+ 5 a prtr+2 _ i S a, pntr 
n 


n=0 

io @) oe) 

= Db [(n4+r)(ntr—-1)4+(ntr)—1Ja,77+ 3 a,_5t"*". 
=0 n=2 


Setting the sum of coefficients of like powers of t equal to zero gives 
F(r)ay=[r(r -1) +r —$]ay =(r? —4)ay =0 (i 
F(i+r)a,=[(+r)r+(itr)- tJa,=[(1+ ry —4]a,=0 (ii) 
and 
F(nt+r)a,=|(n+r)—4]a,=—a,_», n>2 (iii) 


Equation (i) is the indicial equation, and it implies that r, =4, 7, =—+4. 
r,= 7: Set ay=1. Equation (ii) forces a, to be zero, and the recurrence 
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relation (111) implies that 


~~ Gn-2 — aAn—2 
a, = =, 2 ?. 
" F(n+4)  n(n+1) 
This, in turn, implies that all the odd coefficients a,,a,,..., are zero, and 
the even coefficients are given by 
—74 Tl tL 
“2° 9-32-37 
— ay ] _ i 
“44-5 2-3-4551 
L= ag —] tL 
6 6-7 -2-3-4:5-6-7— 7! 


and so on. Proceeding inductively, we see that 
eG 
2n  (2n)'(2n +1) 


Hence 


5! 7! 
is one solution of (20). This solution can be rewritten in the form 


uz 3 
(== (1-5 


7 re 78 
y(n=e?(i-5+ tee] 


en 
+o G4 


1. 
= — sinl. 


yt 


ry = — 3: Set ay =1. Since 1+ 7, =} is also a root of the indicial equation, 
we could, conceivably, run into trouble when trying to solve for aj. 
However, Equation (ii) is automatically satisfied, regardless of the value of 
a,. We will set a, = 0. (A nonzero value of a, will just reproduce a multiple 
of y,(t)). The recurrence relation (iii) becomes 
a, = = en = n>2. 
(n—+) 1 n“—-n n(n—1) 


All the odd coefficients are again zero, and the even coefficients are 


74 
“2 9. OT 
_— 7a, | 
“44.3 4! 
74, _ | 
“66-5 6! 
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and so on. Proceeding inductively, we see that 


i 
2n  (2n)1° 


Hence, 


—;,—1/2 ne ere Se 
yy(t)=t 2 4! 6! 


l 
= — cost 


t 


is a second solution of (20). 


Remark 1. If 7 is a complex root of the indicial equation, then 
o 2) 
y(t)=t" S a,t” 
n=0 


is a complex-valued solution of (9). It is easily verified in this case that both 
the real and imaginary parts of y(t) are real-valued solutions of (9). 


Remark 2. We must set 
(o@) 
y(t)=|e)" & a,” 
n=0 


if we want to solve (9) on an interval where ¢ is negative. The proof is 
exactly analogous to the proof for the Euler equation in Section 2.8.1, and is 
left as an exercise for the reader. 

We summarize the results of this section in the following theorem. 


Theorem 8. Consider the differential equation (9) where t=0 is a regular 
singular point. Then, the functions tp(t) and t?g(t) are analytic at t=0 
with power series expansions 


tp(t) = po + pit + pat? + oT ,t?q(t)= qo t+q,tt+ qt? + se 


which converge for |t|<p. Let r, and r, be the two roots of the indicial 
equation 

r(r—1)+ por +q) = 0 
with r, = r, if they are real. Then, Equation (9) has two linearly independent 


solutions y(t) and y,(t) on the interval 0<t< p of the following form: 
(a) Ifr, —1r, is not a positive integer, then 


0° 00 
y(t)=t" Sat”, y(t)=1? DS Bt”. 
n=0 n=0 
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(b) Ifr, =n, then 
oe) oO 
y(t)=e" XY a,t", y(t)=y(e)lnet+e" D bt”. 
n=0 n=0 
(c) Ifr,—1r, =N, a positive integer, then 
io @) ie 6) 
y(t)=t" > at", yy(t)=ay,(e)ine +e? J bt” 
n=0 n=0 


where the constant a may turn out to be zero. 


EXERCISES 


In each of Problems 1-6, determine whether the specified value of ¢ is a 
regular singular point of the given differential equation. 


1. 4(¢—2)*y"+ ty’+ y=0; t=0 2. t(t—2)*y"+ ny’ + y=0; t=2 


3. (sin t)y" + (cos t)y’+2y = 0; t=0 4. (e’—-l)y”+e'y’+ y=0; t=0 


5. (1—t?)y"/+ +y=0;t=-1 


l , 
sin(t +1)” 
6. ey” +(sint?)y’+ ty =0; t=0 


Find the general solution of each of the following equations. 


7. 2t?y"+3ty’-(1+ )y=0 8. 2ty”"+(1—-2t)y’— y=0 
9. 2ty"+(1+t)y’-2y=0 10. 277y”— ty’+U+by=0 
11. 4ty"+3y’—3y =0 12. 20°y"+(t?—ty’+y=0 


In each of Problems 13-18, find two independent solutions of the given 
equation. In each problem, the roots of the indicial equation differ by a 
positive integer, but two solutions exist of the form t”>%_,a,t”. 


13. t?y"— ty’—(t7 +3) y =0 14. ?y"+(t—?t’)y’— y=0 
15. ty”—(t? +2)y’+ ty =0 16. t?7y"+(t-—t?)y’—t=0 
17. ¢?y"+t(t+1l)y’— y=0 18. ty”—(44+ t)hy’+2y=0 


19. Consider the equation 
t?y"+(t?—3t)y’+3y =0 («) 


(a) Show that r=1 and r=3 are the two roots of the indicial equation of (*). 
(b) Find a power series solution of (*) of the form 


ie @) 
y(r)aer Dd at", ag=l. 
n=0 
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20. 


21. 


22. 


2.8 Series solutions 


(c) Show that y,(t) = te‘. 
(d) Show that (*) has no solution of the form 


ioe) 
t > bt". 
n=0 


(e) Find a second solution of (*) using the method of reduction of order. Leave 
your answer in integral form. 


Consider the equation 
t?y"+ ty’—(1+t)y=0. 


(a) Show that r= —1 and r=] are the two roots of the indicial equation. 
(b) Find one solution of the form 


00 
y(t)=t d a,t” 
n=0 


(c) Find a second solution using the method of reduction of order. 
Consider the equation 
ty" + ty’+2y=0. 


(a) Show that r=0 and r=1 are the two roots of the indicial equation. 
(b) Find one solution of the form 


ore) 
y(t) =t > a,t”. 
n=0 


(c) Find a second solution using the method of reduction of order. 
Consider the equation 
ty” +(1—¢t7)y’+4ty =0. 


(a) Show that r= 0 is a double root of the indicial equation. 
(b) Find one solution of the form y,(t) = 27_,)a,t”. 
(c) Find a second solution using the method of reduction of order. 


. Consider the Bessel equation of order zero 


ty" + ty’+t?y=0. 


(a) Show that r= 0 is a double root of the indicial equation. 
(b) Find one solution of the form 


t* 16 
a2 are 


#2 
w(al—~ + 


This solution is known as J)(t). 
(c) Find a second solution using the method of reduction of order. 
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25. 


26. 


27. 


28. 


29. 


Second-order linear differential equations 


Consider the Bessel equation of order v 

ty" + ty’+(1?—»*)y=0 
where p is real and positive. 
(a) Find a power series solution 


rs 


Jt) = v > a,t", ay=l. 
2"v! =o 


This function J,(t) is called the Bessel function of order v. 
(b) Find a second solution if 2» is not an integer. 


The differential equation 
ty”+(1—t)y’+Ay=0, Aconstant, 


is called the Laguerre differential equation. 

(a) Show that the indicial equation is r? = 0. 

(b) Find a solution y(t) of the Laguerre equation of the form 3°) a,t”. 
(c) Show that this solution reduces to a polynomial if A =n. 


The differential equation 
t((i-t)y”+[y-C(+atB)t] y’— apy =0 


where a, B, and y are constants, is known as the hypergeometric equation. 

(a) Show that ¢ = 0 is a regular singular point and that the roots of the indicial 
equation are 0 and 1-y. 

(b) Show that t=1 is also a regular singular point, and that the roots of the 
indicial equation are now 0 and y—a-—fB. 

(c) Assume that y is not an integer. Find two solutions y(t) and y,(t) of the 
hypergeometric equation of the form 


oO oe) 
y(t) = »» a,t”, y(ty=t7 > b,t". 
n=0 n=0 


(a) Show that the equation 
2(sint)y”+(1—-t)y’-2y=0 
has two solutions of the form 
00 oO 
n(y= Dat", y(ryae'’? DY be". 
n=0 n=0 
(b) Find the first 5 terms in these series expansions assuming that ay = by = 1. 


Let y(t) = u(t)+ iv(t) be a complex-valued solution of (3) with p(t) and q(t) 
real. Show that both u(t) and v(t) are real-valued solutions of (3). 


(a) Show that the indicial equation of 
ty’+t’+(1+t)y=0 ( *) 


has complex roots r= +i. 


218 


2.8 Series solutions 


(b) Show that (+) has 2 linearly independent solutions y(t) of the form 


fo.6) lo 6) 
y(t)=sin(int) § a,t"+cos(Int) > 5,2”. 
n=0 n=0 


2.8.3 Equal roots, and roots differing by an integer 


Equal roots. 
We run into trouble if the indicial equation has equal roots r, =r, because 
then the differential equation 


d*y dy _ 
P(t) + O(1) F + R(1)y=0 (1) 
has only one solution of the form 
1,6) 
y(t)=0t" ¥ a,t". (2) 
n=0 


The method of finding a second solution is very similar to the method used 
in finding a second solution of Euler’s equation, in the case of equal roots. 
Let us rewrite (2) in the form 


yty=ylry=t X a,(r)e" 
n=0 
to emphasize that the solution y(t) depends on our choice of r. Then (see 
Section 2.8.2) 
Ll y|(t,r)=ayF(r)t" 
ore) n—]| 
+ 2 feair)FC +r)+ (A +r) p,-x + In—] aerr 
n=l k=0 


We now think of r as a continuous variable and determine a, as a function 
of r by requiring that the coefficient of t"*’ be zero for n=>1. Thus 


n—-1 
— J [(k+r) pint In—el ax 
a,(r) = <9 —________ 
” F(n+r) 
With this choice of a,(r), we see that 
Ll y](t,7) =agF(r)t”. (3) 


In the case of equal roots, F(r)=(r—r,)*, so that (3) can be written in the 
form 


2 r 
Li y(t, r)=ag(r—7))'t". 
Since L[ y](¢, 7,) = 0, we obtain one solution 


ay + x enn". 


n= 


y(t) =2" 
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Observe now, that 


Arp len=1] 2 | 


0 r 
= 9, 40? — ryt 


=2a,(r—r,)t’ +a )(r—r,) (Int)t” 
also vanishes when r = re Thus 


y(t) = v(t, rN r=r 


a |S ay(ryer™ 


r=rTr, 


[o.@) (o-@) 
=> [a,(r,)¢7*" Int + > a(n err" 


0 n=O 


(oe) 
— y(t int + > ay(r, jt" 
n=0 


is a second solution of (1). 
Example 1. Find two solutions of Bessel’s equation of order zero 
»d*y _ 
= > 
Li y] en tag Thy 0, ¢t>0. (4) 
Solution. Set 


io 6) 
y(t)= y at" 
n=0 


Computing 
CO 
y(t)= > (nt+r)a,t"t! 
n=0 
and 
oe) 
y (t= > (ntr)\(ntr—-l1ya,tr? 
n=0 
we see that 


| 
I D48 1 M8 


o 


ed oe) 
Ll y] (n+r)\(ntr—l)a,t"*'+ SS (nt+rj)a,t77+ > atrtr*? 


n=0 n=O 


oO 


= 


oO 
(ntr)°a,t?t'+ > a,_yt”*’. 
n=2 


= 
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Setting the sums of like powers of t equal to zero gives 
(i) r*ay = F(r)ay =0 

(ii) (+ r)*a,= F(l+r)a,=0 

and 

(iii) (n+r)*a,= F(nt+r)a,=—a,_,,n=2. 

Equation (i) is the indicial equation, and it has equal roots r,=r, =0. 
Equation (ii) forces a, to be zero, and the recurrence relation (iii) says that 
_ a,—2 

(n+r) | 


Clearly, a, =a;=a,=--- =0. The even coefficients are given by 


—] 
ar)= an) (+r) 


] 
ar ; (24 r)(4+ r) 
and so on. Proceeding inductively, we see that 
—])" 
a,(r) = —__{-) _ 
(2+r)(44r)---(Qn+r) 


To determine y(t), we set r= 0. Then 


a,(0)= ? 
a,(0) = 


a,(r)= 


Iol 
92.42 i 74 (2!) 
—] —] 
a,.(0) = ——— = — 
ol )= 92. 42. 62 ~ 26317 
and in general 


ye fc 0" 
20) = 4?.--(2nP 22"(n1)? 


Hence, 
t? t4 1® 
t)=1-— + ——__ - —___ + 
yi ) 92 24.(21)° 26(31)° 
oo (— (=1)"t?" pon 
n=0 22"(n!) 


is one solution of (4). This solution is often referred to as the Bessel function 
of the first kind of order zero, and is denoted by J,(t). 
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To obtain a second solution of (4) we set 


y(t)=y(t)intt+ > a5,(0)t?". 
n=0 
To compute a4,(0), observe that 
ar) _ d 


_@ -2 0 -2 
a,,(r) dr Inja,,(r)|= 7 In2+r) (2n+r) 


= -2£ [in(2+ r)+in(4t+r)+---+In(2n+7)] 


=-2(—+ | get ). 


Qtr’ 44+r °° wntr 
Hence, 
/(9y-—2(141 
a,(0)=-2(5 +g +++ +52) aa9(0) 
| l 
=—{l+54 - + —}a,,(0) 
Setting 
11 l 
H,=lta+q4000 +5 (5) 
we see that 
Soy AAV" _ (-1)"*'H, 
2300) =? 2a ye 
22"(n!) 22"(n!) 
and thus 
00 (—1)""'H, ; 
y(t)=y(t)ne+ Y 
n=O 2 "(n!) 


is a second solution of (4) with H,, given by (5). 
Roots differing by a positive integer. Suppose that 7, andr, =rnt+N, Na 
positive integer, are the roots of the indicial equation. Then we can certainly 
find one solution of (1) of the form 
1°) 
y(t)=t" )) a, (r,)t". 
n=(0 
As we mentioned previously, it may not be possible to find a second 
solution of the form 
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In this case, Equation (1) will have a second solution of the form 


fl) 
y(t) = op V(t") => 


2 


io @) 
=ay(t)nt+ a’ (r,)t"*” 
n=0 


where a is a constant, and 


y(t,r)=t’ S a,(r)e" 


with 
ay=a,(r)=r—n. 


The proof of this result can be found in more advanced books on differen- 
tial equations. In Exercise 5, we develop a simple proof, using the method of 
reduction of order, to show why a logarithm term will be present. 


Remark. It is usually very difficult, and quite cumbersome, to obtain the 
second solution y,(t) when a logarithm term is present. Beginning and 
intermediate students are not expected, usually, to perform such calcula- 
tions. We have included several exercises for the more industrious students. 
In these problems, and in similar problems which occur in applications, it is 
often more than sufficient to find just the first few terms in the series 
expansion of y,(¢), and this can usually be accomplished using the method 
of reduction of order. 


EXERCISES 


In Problems 1 and 2, show that the roots of the indicial equation are equal, 
and find two independent solutions of the given equation. 


lL. "+ y’—4y=0 
2. t*y”’—t(l+t)y’+y=0 


3. (a) Show that r= —1 and r=] are the roots of the indicial equation for Bessel’s 
equation of order one 
d*y dy 
27 4 p= 4 (72-1) p=O. 
t i th (t1*—-1)y=0 


(b) Find a solution: 


J(t)=4t ¥ a,t",a)=1. 


n=0 
J,(t) is called the Bessel function of order one. 
(c) Find a second solution: 


l 2 (-1)"(4,,+ FA,_1) 
t)=—J,(t)Int+—J1—-— a gn 
rl =—AC)inee 2 22" (n—1)! 
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4. Consider the equation 
ty”+3y’—3y =0, t>0. 


(a) Show that r=0 and r= —2 are the roots of the indicial equation. 
(b) Find a solution 


oe) 
y(t) = > a,t”. 
n=0 


(c) Find a second solution 


y(t)=y(Oint +2 + p+ art ser + vee, 
t 


5. This exercise gives an alternate proof of some of the results of this section, using 
the method of reduction of order. 
(a) Let t=0 be a regular singular point of the equation 


ty” + tp(t)y’+ q(t)y=0 (i) 


Show that the substitution y = t’z reduces (i) to the equation 


122"+[2r+ p(t)]z’+[r(r-Nt+mp(t)+q(t)]z=0. (ii) 


(b) Let r be a root of the indicial equation. Show that (ii) has an analytic 
solution z,(t) = 2°_,a,t”. 
(c) Set z(t) =z,(t)v(t). Show that 


e fl2rtp(@)]/tdt 
2 
zi (t) 


(d) Suppose that r=7) is a double root of the indicial equation. Show that 
2% + Po =1, and conclude therefore that 


v(t) = fu(z) dt, whereu(t)= 


u 
u(t) = + uy + ut + vee, 


(e) Use the result in (d) to show that y,(t) has an In? term in the case of equal 
roots. 

(f) Suppose the roots of the indicial equation are ry and r,— N, N a positive 
integer. Show that 27) + pp =1+ N, and conclude therefore, that 


u(t)= sew i(t) 


where u(t) is analytic at t= 0. 

(g) Use the result in (f) to show that y,(t) has an In¢ term if the coefficient of 1% 
in the expansion of a(t) is nonzero. Show, in addition, that if this coefficient 
is zero, then 


_ L-Nn vO_-| 
v= + ve + + ott opt? + ss 
and y,(t) has no Int term. 
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2.9 The method of Laplace transforms 


In this section we describe a very different and extremely clever way of 
solving the initial-value problem 
d2 

a 462 +=f();  YO=%% ¥O=% (1) 
where a, b and c are constants. This method, which is known as the 
method of Laplace transforms, is especially useful in two cases which arise 
quite often in applications. The first case is when f(t) is a discontinuous 
function of time. The second case is when f(t) is zero except for a very 
short time interval in which it is very large. 

To put the method of Laplace transforms into proper perspective, we 
consider the following hypothetical situation. Suppose that we want to 
multiply the numbers 3.163 and 16.38 together, but that we have forgotten 
completely how to multiply. We only remember how to add. Being good 
mathematicians, we ask ourselves the following question. 


Question: Is it possible to reduce the problem of multiplying the two num- 
bers 3.163 and 16.38 together to the simpler problem of adding two num- 
bers together? 


The answer to this question, of course, is yes, and is obtained as follows. 
First, we consult our logarithm tables and find that 1n3.163 = 1.15152094, 
and In 16.38 = 2.79606108. Then, we add these two numbers together to 
yield 3.94758202. Finally, we consult our anti-logarithm tables and find 
that 3.94758202 = 1n51.80994. Hence, we conclude that 3.163 x 16.38 = 
51.80994. 

The key point in this analysis is that the operation of multiplication is 
replaced by the simpler operation of addition when we work with the loga- 
rithms of numbers, rather than with the numbers themselves. We represent 
this schematically in Table 1. In the method to be discussed below, the un- 
known function y(t) will be replaced by a new function Y(s), known as the 
Laplace transform of y(t). This association will have the property that y’(Z) 
will be replaced by sY(s)—y(0). Thus, the operation of differentiation with 
respect to ¢ will be replaced, essentially, by the operation of multiplication 
with respect to s. In this manner, we will replace the initial-value problem 
(1) by an algebraic equation which can be solved explicitly for Y(s). Once 
we know Y(s), we can consult our “anti-Laplace transform” tables and re- 
cover y(f). 


Table | 


Ina 


Inb 
Ina+Inb 
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We begin with the definition of the Laplace transform. 


Definition. Let f(t) be defined for 0<t< oo. The Laplace transform of 
f(2), which is denoted by F(s), or £{ f(d}, is given by the formula 


F(s)=£{ f()} = f “ef (t)dt (2) 


where 
[Pes(at= tim f Me-"f(t)dt. 
0 Ao JQ 


Example 1. Compute the Laplace transform of the function f(t)=1. 
Solution. From (2), 


C{s()}= fim fe" dr= lim 


A-c AY 
-|é s>0 
=J 5 
co, s<O0 


Example 2. Compute the Laplace transform of the function e”. 
Solution. From (2), 


l—e~4 


A (a—s)A _ 
P{e"}= lim f e “e*dt= lim et 
A> 4/9 Ae _ 
l 
-| , soa 
=, s—-a . 
CO, S<a 


Example 3. Compute the Laplace transform of the functions coswt and 
sin wt. 
Solution. From (2), 


io @) ioe) 
£{cosw} = f e-“coswtdt and {sinwr} = [ e~ “sin wt dt. 
0 0 


Now, observe that 


. . oo ; e A ; 
CP {cos wr} + if {sin wr} =[ ee! dr= lim f elio—s)t oy 
0 0 


A->o 
eliw—s)A_ | 
= lim ————— 
A>o  IlWw—-Ss 
L _ s+ iw s>0 
={ S—1W 52+ G52 ; 
undefined, s<0 
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Equating real and imaginary parts in thts equation gives 


C{coswt} = s>0. 


and &{sinwt}= —“— 
; and P{sinwr}= 


st + w?” 
Equation (2) associates with every function f(t) a new function, which 
we call F(s). As the notation £{ f(r)} suggests, the Laplace transform is an 


operator acting on functions. It is also a linear operator, since 
°° —st 
CicaS, (Qtoh(O}=/ e Leh (t) +c fy (t) | at 


=f e-"f, (t)dt+ cf ef (tat 
0 0 
=cl{ fi (t)}+c.£{ f(D}. 


It is to be noted, though, that whereas f(t) is defined for 0< t< oo, its 
Laplace transform is usually defined in a different interval. For example, 
the Laplace transform of e” is only defined for 2< 5 < oo, and the Laplace 
transform of e* is only defined for 8<s< oo. This is because the integral 
(2) will only exist, in general, if s is sufficiently large. 

One very serious difficulty with the definition (2) is that this integral 
may fail to exist for every value of s. This is the case, for example, if f(t) = 
e” (see Exercise 13). To guarantee that the Laplace transform of f(t) exists 
at least in some interval s > 5 9, we impose the following conditions on f(?). 


(i) The function f(t) is piecewise continuous. This means that f(t) has at 
most a finite number of discontinuities on any interval 0<¢< A, and 
both the limit from the right and the limit from the left of f exist at ev- 
ery point of discontinuity. In other words, f(t) has only a finite number 
of “jump discontinuities” in any finite interval. The graph of a typical 
piecewise continuous function f(¢) is described in Figure 1. 


__ 


Figure 1. Graph of a typical piecewise continuous function 
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(ii) The function f(¢) is of exponential order, that is, there exist constants 
M and c such that 


If()|< Me*, O<t<o. 


Lemma 1. /f f(t) is piecewise continuous and of exponential order, then its 
Laplace transform exists for all s sufficiently large. Specifically, if f(t) is 
piecewise continuous, and | f(t)|< Me”, then F(s) exists for s>c. 


We prove Lemma 1 with the aid of the following lemma from integral 
calculus, which we quote without proof. 
Lemma 2. Let g(t) be piecewise continuous. Then, the improper integral 


f **g(t) dt exists if f * g(t)|dt exists. To prove that this latter integral ex- 
ists, it suffices to show that there exists a constant K such that 


A 
[le@la<k 
for all A. 


Remark. Notice the similarity of Lemma 2 with the theorem of infinite 
series (see Appendix B) which states that the infinite series 5) a, converges 
if >) |a,| converges, and that >) |a,| converges if there exists a constant K 


such that |a,|+...+]|a,|< K for all n. 
We are now in a position to prove Lemma 1. 
PROOF OF LEMMA Il. Since f(f) is piecewise continuous, the integral 


J, “ens f(t)dt exists for all.A. To prove that this integral has a limit for all s 
sufficiently large, observe that 


A A 
[ le"s(|ae< Mf e “e" dt 
0 0 


— M [e994 -1] < M 
Cc S—-C 


~ 


for s>c. Consequently, by Lemma 2, the Laplace transform of f(t) exists 
for s>c. Thus, from here on, we tacitly assume that | f(‘)|< Me“, and 
s>c. CL 


The real usefulness of the Laplace transform in solving differential 
equations lies in the fact that the Laplace transform of f’(t) is very closely 
related to the Laplace transform of f(t). This is the content of the follow- 
ing important lemma. 
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Lemma 3. Let F(s)=£{f())}. Then 
PL f(D} =s£{f(4)} —fO) =sF(s)—f(O). 


Proor. The proof of Lemma 3 is very elementary; we just write down the 
formula for the Laplace transform of f’(t) and integrate by parts. To wit, 


e{F}= jim [Cesar 
—_— 1 — St A ' A — Sst 
= lim e FCO] + fim s [ ef (t)dt 
‘ A —st 
=—f(O)+s lim J e-“f(t),dt 
= ~f(0) + sF(s). Oo 


Our next step is to relate the Laplace transform of f”(?) to the Laplace 
transform of f(t). This is the content of Lemma 4. 


Lemma 4. Let F(s)=£{ f(t)}. Then, 
e(F"()} =F (s)—sf(0)-F'O). 
Proor. Using Lemma 3 twice, we see that 
eer} =sl( FO} -/'O) 
= s[ sF(s)-f]-F'O 
= 5°F (s) — sf (0)—f'(0). 0 
We have now developed all the machinery necessary to reduce the prob- 
lem of solving the initial-value problem 


d? 
a 462 +9=f();  »O=%o ¥O=% (3) 


to that of solving an algebraic equation. Let Y(s) and F(s) be the Laplace 


transforms of y(t) and f(t) respectively. Taking Laplace transforms of both 
sides of the differential equation gives 


Cf ay" (t) + by’(t)+ cy(t)} = F(s). 
By the linearity of the Laplace transform operator, 
Clay"(t)t+ by ()+o(t)}=al{y"(O}+b£{y'(D}+c£L{y(—}, 


229 


2 Second-order linear differential equations 


and from Lemmas 3 and 4 


L{y()}=s¥(s)—y, Lf y"()} =8°¥ (5)— 90 -Yo. 


Hence, 
al s’¥ (s) — Yo-Yo | + b[ sY (s) —yo| +cY(s)=F(s) 
and this algebraic equation implies that 


(as+d)yo aw F(s) 
ast+bs+ec  as*+bst+c as?+bste. 


Y(s)= (4) 

Equation (4) tells us the Laplace transform of the solution y(t) of (3). To 
find y(t), we must consult our anti, or inverse, Laplace transform tables. 
Now, just as Y(s) is expressed explicitly in terms of y(t); that is, Y(s) 


= frre est y(t)dt, we can write down an explicit formula for y(t). However, 


this formula, which is written symbolically as y(t)=2~'{ Y(s)}, involves 
an integration with respect to a complex variable, and this is beyond the 
scope of this book. Therefore, instead of using this formula, we will derive 
several elegant properties of the Laplace transform operator in the next 
section. These properties will enable us to invert many Laplace transforms 
by inspection; that is, by recognizing “which functions they are the 
Laplace transform of”. 


Example 4. Solve the initial-value problem 


d2 
Oo 39 yay ae y(O)=1, y(0)=0. 


Solution. Let Y(s)=£{y(t)}. Taking Laplace transforms of both sides of 
the differential equation gives 


s*¥ (s)—s—3[sY(s)-1] +2Y(s)= = 
and this implies that 
_ l s—3 
m= (s—3)(s?—3542) 52-3542 
= tH 
~ G=NGE-De=3) * =D(s=2)- ©) 


To find y(t), we expand each term on the right-hand side of (5) in partial 
fractions. Thus, we write 
1 __A B S 
(s—1)(s—2)(s—3) s-1s-2~ 


b 
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This implies that 
A(s—2)(s—3)+ B(s—1)(s—3) + C(s—1)(s—2)=1. (6) 
Setting s=1 in (6) gives A=}; setting s=2 gives B= — 1; and setting s=3 
gives C= +. Hence, 
l 11 1 1 
(s—1)(s—2)(s—3) 2s-1 s-2 25-3. 


Similarly, we write 


(s—1)(s—2) s-1l s-—2 


and this implies that 


D(s—2)+ E(s—1)=s-3. (7) 
Setting s=1 in (7) gives D=2, while setting s=2 gives E= — 1. Hence, 
vit wt ,1 1, 2 __! 
W(s)=5 5-7 5-2 725-3 15-1 5-2 
5 1 2 1 1 


+= , 
2s—-1 s-2 258-3 
Now, we recognize the first term as being the Laplace transform of 3e’. 
Similarly, we recognize the second and third terms as being the Laplace 
transforms of —2e”’ and $e, respectively. Therefore, 
Y (s)=£{ $e'—2e* + Le} 
so that 
y(t) =3e'—2e% + 5e™. 


Remark. We have cheated a little bit in this problem because there are ac- 
tually infinitely many functions whose Laplace transform is a given func- 
tion. For example, the Laplace transform of the function 


7(1)= 2e'—Je* + et, t#1,2, and 3 
0, t=1,2,3 


is also Y(s), since z(t) differs from y(t) at only three points.* However, 
there is only one continuous function y(t) whose Laplace transform is a 
given function Y(s), and it is in this sense that we write y(t)=£~'{ Y(s)}. 


We wish to emphasize that Example 4 is just by way of illustrating the 


method of Laplace transforms for solving initial-value problems. The best 
way of solving this particular initial-value problem is by the method of 


*If f(t)= g(t) except at a finite number of points, then J "t(t)dt= J "o(t)dt. 
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judicious guessing. However, even though it is longer to solve this particu- 
lar initial-value problem by the method of Laplace transforms, there is still 
something “nice and satisfying” about this method. If we had done this 
problem by the method of judicious guessing, we would have first com- 
puted a particular solution y(1)= +e’. Then, we would have found two in- 
dependent solutions e‘ and e* of the homogeneous equation, and we 
would have written 


y(t)=ce'+ ce" + Se" 


as the general solution of the differential equation. Finally, we would have 
computed c, =3 and c,= —2 from the initial conditions. What is unsatisfy- 
ing about this method is that we first had to find a// the solutions of the 
differential equation before we could find the specific solution y(t) which 
we were interested in. The method of Laplace transforms, on the other 
hand, enables us to find y(#) directly, without first finding all solutions of 
the differential equation. 


EXERCISES 


Determine the Laplace transform of each of the following functions. 


1. 2. t” 

3. e* cos bt 4. e“ sin bt 

5. cos at 6. sin? at 

7. sinatcosbt 8. ?’sint 

9. Given that f "e-* dx=WVa /2, find © {t—'/?}. Hint: Make the change of vari- 


able u= V1 in (2). 
Show that each of the following functions are of exponential order. 


10. 1” 11. sinat 12. eV! 


13. Show that e” does not possess a Laplace transform. Hint: Show that e”~* > e! 
for t>s+l. 


14. Suppose that f(t) is of exponential order. Show that F(s)=£{ f(2)} approaches 
0 as soo. 


Solve each of the following initial-value problems. 
15. y"—Sy’+4y=e"; y(O)=1, y'(O)=-1 
16. 2y”+y’-—y=e*; y(0)=2, y'(0)=0 


Find the Laplace transform of the solution of each of the following initial- 
value problems. 


17. y"+2y’+y=e'; yO)=1, y'(O)=3 
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18. y’+y=?sint; y(0)=y'(0)=0 
19. y”+3y’+7y=cost; y(0)=0, y(0)=2 
20. y"+y’+ty=P; y(0)=2, y’(0)=0 


21. Prove that all solutions y(t) of ay” + by’+cy=f(#) are of exponential order if 
f(é) is of exponential order. Hint: Show that all solutions of the homogeneous 
equation are of exponential order. Obtain a particular solution using the 
method of variation of parameters, and show that it, too, is of exponential 


order. 
22. Let F(s)=£{f(O}. Prove that 
a"f(t df"- (0 
e| 70 =s"F(s)—s"~'f(0)-...- — 


Hint: Try induction. 


23. Solve the initial-value problem 
yy” —6y" + 1ly’-6y =e"; (0) =y'(0) =y"(0) =0 
24. Solve the initial-value problem 


y"—3y'+2y=e"'; y(t) =1, y(t) =0 


by the method of Laplace transforms. Hint: Let $(t)=y(t+ to). 


2.10 Some useful properties of Laplace transforms 


In this section we derive several important properties of Laplace trans- 
forms. Using these properties, we will be able to compute the Laplace 
transform of most functions without performing tedious integrations, and 
to invert many Laplace transforms by inspection. 


Property 1. If £{ f(4)} = F(s), then 


£{-()} =< F(s). 


ProoF. By definition, F(s)= Joe f(idt. Differentiating both sides of 
this equation with respect to s gives 


&F()=4 pep (that 
_ (° ®(o-s _ [yes 
= geles(ae= [ep (at 
=£{ -sf(1)}. O 
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Property 1 states that the Laplace transform of the function — ¢f(f) is 
the derivative of the Laplace transform of f(t). Thus, if we know the 
Laplace transform F(s) of f(¢), then, we don’t have to perform a tedious 
integration to find the Laplace transform of tf(t); we need only differenti- 
ate F(s) and multiply by —1. 


Example 1. Compute the Laplace transform of te’. 
Solution. The Laplace transform of e’ is 1/(s—1). Hence, by Property 1, 
the Laplace transform of te’ is 


Example 2. Compute the Laplace transform of ¢)°. 
Solution. Using Property | thirteen times gives 


ype 1 _ (3)! 


dss gl4 | 


bd? 
ds'3 


E(t y=(-I GL a(- 


The main usefulness of Property 1 is in inverting Laplace transforms, as 
the following examples illustrate. 


Example 3. What function has Laplace transform —1/(s—2)’? 
Solution. Observe that 


Example 4. What function has Laplace transform —4s/(s*+4)?? 
Solution. Observe that 


__ 4s  _d_2 
(s?+ 4) ds +4 
Hence, by Property 1, 


a — Gea} | = —¢sin2t. 


Example 5. What function has Laplace transform 1 /(s — 4)*? 
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Solution. We recognize that 
l d7 1 1 


(s—4)° ~ ds? 2 s—4 


Hence, using Property 1 twice, we see that 


ana =£{ 52". 


Property 2. If F(s)=£{f(0}, then 
R{ef(t)} = F(s—a). 
ProoF. By definition, 


P{ews(D}= fo ertenf(idem f° ele f(t)ade 


= [eG /()dr= F(s—a). o 


Property 2 states that the Laplace transform of e” f(t) evaluated at the 
point s equals the Laplace transform of f(t) evaluated at the point (s— a). 
Thus, if we know the Laplace transform F(s) of f(t), then we don’t have to 
perform an integration to find the Laplace transform of e” f(t); we need 
only replace every s in F(s) by s—a. 


Example 6. Compute the Laplace transform of e* sint. 

Solution. The Laplace transform of sint is 1/(s?+1). Therefore, to com- 
pute the Laplace transform of e*’sint, we need only replace every s by 
s—3; that 1s, ; 


Rf{e*sins} = (s—3) 41 


The real usefulness of Property 2 is in inverting Laplace transforms, as 
the following examples illustrate. 


Example 7. What function g(t) has Laplace transform 
G(s)= i=! 
25+(s—7) 
Solution. Observe that 


S 
F(s)= 452 = {cos5r} 


and that G(s) is obtained from F(s) by replacing every s by s—7. Hence, 
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by Property 2, 
so 1 _ =£{e"cos5t}. 
(s—7)° +25 
Example 8. What function has Laplace transform 1 /(s*—4s+9)? 
Solution. One way of solving this problem is to expand 1/(s*—4s+9) in 
partial fractions. A much better way is to complete the square of s*—4s+ 
9. Thus, we write 
ee ee eS 
s?—4s+9  5*-45+44+(9-—4) (s—2)+5 


Now, 


=e{ 1 sinv5 «} 
s7+5 V5 
Hence, by Property 2, 
_ |i = —1— =| 4 etssinv5 1} 
s?—4s+9 (s—2)°+5 V5 


Example 9. What function has Laplace transform s/(s?—4s+9)? 
Solution. Observe that 
s —_  gs-2 2 
2 acho) 2a? 2g 
s*—-4s+9  (s—2)'+5 (s—2)°+5 
The function s/(s*+5) is the Laplace transform of cosV5 ¢. Therefore, by 
Property 2, 
—s-2 =2{e*cosV5 r}, 
(s—2)°+5 
and 
> = e{ e% cos v5 t+ 2 ot in V5 } 
s*—4s+9 V5 
In the previous section we showed that the Laplace transform is a linear 
operator; that is 


Llahidt+oh(d} = l{f()}p+al{A(p}. 


Thus, if we know the Laplace transforms F,(s) and F,(s), of f,(¢) and f,(2), 
then we don’t have to perform any integrations to find the Laplace trans- 
form of a linear combination of f,(¢) and f,(1); we need only take the same 
linear combination of F,(s) and F,(s). For example, two functions which 
appear quite often in the study of differential equations are the hyperbolic 
cosine and hyperbolic sine functions. These functions are defined by the 
equations 

et + e~u eu — ea 


sinh at = 


coshat = 5 , 5 
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Therefore, by the linearity of the Laplace transform, 


°{coshat}= . R few} +5 &{e*} 


_i/_! 1 |j__s 
2|s-a sta 5? — Q? 
and 


& {sinhat} = 5 £{em } - 5 &{e-*)} 


EXERCISES 


Use Properties 1 and 2 to find the Laplace transform of each of the follow- 
ing functions. 


1. 2” 2. t"e 3. tsinat 4. t*cosat 
5. 1°/2 (see Exercise 9, Section 2.9) 


6. Let F(s)=£{f(2)}, and suppose that f(t)/t has a limit as ¢ approaches zero. 
Prove that 


{ f(t)/t}= f °F (u)du. (*) 


(The assumption that f(1)/t has a limit as 0 guarantees that the integral on 
the right-hand side of (*) exists.) 


7. Use Equation (*) of Problem 6 to find the Laplace transform of each of the 


following functions: 
sint cosat — | et — 
a= om — © 


Find the inverse Laplace transform of each of the following functions. In 
several of these problems, it will be helpful to write the functions 


a,set+ Bi s?+y st, a,s?+Bisty 
P\(s)= ————.—_ and: p(s) = ——__,-———~ 
(as*+bs+c)(ds*+est+f) (as + b)(cs?+ds+e) 
in the simpler form 
As+B Cs+D A Cs+D 
5) = + "sand s)= oe” 
Pi(s) as*+bs+c  ds*+es+f P2(s) ast+b  ¢s*+ds+e 
s s?—5 
"(stay +b? "534452435 
| S 
10. ————_ 11, ———_—_ 
s(s7+4) s*—35—12 
1 358 
2, —-—_—--- > 13. 
(s? + a*)(s? + 5?) (s+1)° 
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15, ——_*—__ 
s(s+4) (s+1)(s2+1) 
] 
(st+1) 
17. Let F(s)=£{ f(#)}. Show that 
f()=— Fe" F(s)}. 


Thus, if we know how to invert F’(s), then we can also invert F(s). 


14. 


16. 


18. Use the result of Problem 17 to invert each of the following Laplace transforms 


(a) In( ste) (b) arc tan— (c) a — “ 


Solve each of the following initial-value problems by the method of 
Laplace transforms. 


19. y’+y=sint; y(O)=1, y(O)=2 

20. y’+y=tsint; y(O)=1, y’(O)=2 

21. y”—2y’+y=te'; y(0)=0, y’(0)=0 

22. y” —2y’+7y=sint; y(0)=0, y’(0)=0 
23. y"+y’+y=l+e 5; y(O)=3, y(O)=—5 


" — 2, 0<t<3 . = , = 
a. y"+y={2_, SSISFs yO=0,y@=0 


2.11 Differential equations with discontinuous 
right-hand sides 


In many applications, the right-hand side of the differential equation ay” + 
by’+ cy =f(t) has a jump discontinuity at one or more points. For exam- 
ple, a particle may be moving under the influence of a force f,(4), and 
suddenly, at time ¢,, an additional force f,(t) is applied to the particle. Such 
equations are often quite tedious and cumbersome to solve, using the 
methods developed in Sections 2.4 and 2.5. In this section we show how to 
handle such problems by the method of Laplace transforms. We begin by 
computing the Laplace transform of several simple discontinuous func- 
tions. 

The simplest example of a function with a single jump discontinuity is 
the function 


mida(% 95% 


This function, whose graph is given in Figure 1, is often called the unit step 
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C 
Figure 1. Graph of H,(#) 


function, or the Heaviside function. Its Laplace transform is 
ioe) co 
C{ H, (t)} =[ e-"H, (t)dt= f edt 
0 c 


; A e~S—e 3A 
= lim f e-"dt= lim —————— 
A> Jo A—>0o RY 


= , s>0. 
S 


Next, let f be any function defined on the interval 0<t< 00, and let g 
be the function obtained from f by moving the graph of f over c units to 
the right, as shown in Figure 2. More precisely, g(t)=0 for O<t<c, and 
g(t)=f(t—c) for t>c. For example, if c=2 then the value of g at t=7 is 
the value of f at t=5. A convenient analytical expression for g(¢) 1s 


8(t)= H(t) f(t—¢). 


The factor H,(t) makes g zero for 0< ¢t<c, and replacing the argument ¢ 
of f by t—c moves f over c units to the right. Since g(¢) is obtained in a 
simple manner from f(t), we would expect that its Laplace transform can 
also be obtained in a simple manner from the Laplace transform of f(t). 
This is indeed the case, as we now show. 


~~ 


Figure 2 
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Property 3. Let F(s)=£{f(d}. Then, 
C{H. (A) f(t—¢)} =e" OF (5). 
Proor. By definition, 
O(H.()f(t-0)}= [eH F(t—e) at 
= [Ces (to)at 


This integral suggests the substitution 


€=t-—c. 
Then, 
[oes (t-o)dt= [Me fae 
Cc 0 
— pos °° — st 
ee fe *f(@de 
=e “F(s). 
Hence, £{ H(i) f(t—0)} =e #2 (f(9}. oO 


Example 1. What function has Laplace transform e~°/s? 
Solution. We know that 1/s* is the Laplace transform of the function 1. 
Hence, by Property 3 
e —S 
“2 = C{ HA, (t)(t- 1)}. 
The graph of H,,(¢)(t— 1) is given in Figure 3. 


Figure 3. Graph of H,(t) (t-1) 
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Example 2. What function has Laplace transform e~**/(s?—2s—3)? 


Solution. Observe that 
l _~ J _ l 


s?—2s—3  s*-2s+1-4 (s—1)-2? 
Since 1/(s?—2?)= {4 sinh2r}, we conclude from Property 2 that 


1a e{Lersinn2r} 
(s—1)-2 v 
Consequently, from Property 3, 
— 3s 
aera =£| +H, (‘)e'~> sinh (1-3) 


Example 3. Let f(z) be the function which is ¢ for0<¢<1, and 0 for ¢> 1. 
Find the Laplace transform of f without performing any integrations. 
Solution. Observe that f(t) can be written in the form 


f(t)=t[ Ho (t)— Ay (1) ] =t— 1H, (0). 
Hence, from Property 1, 
R{ f(O}=L{t}-L{ tH, (2)} 


~l,de*_l_et_ 
s2? as s 52 s 52 


Example 4. Solve the initial-value problem 


d*y dy 1O<r<l; 0, 1<t<2; 
Ge at TW ALORYL 2<1<3; 0, 3<1<4; — ¥(0)=0, y'(0)=0 
f 1,4<1t<5; 0, 5<t<o. 


Solution. Let Y(s)=£{y(d)} and F(s)=£{ f(2)}. Taking Laplace trans- 
forms of both sides of the differential equation gives (s*—3s+2)Y(s)= 
F(s), so that 


F(s) F(s) 


MO" 3542 = DE=2) 


One way of computing F(s) is to write f(f) in the form 


f)=[A(Y- A ()]+ [AL (O)- 83 (0) ] + [Ha ()- Fs (1)]. 
Hence, by the linearity of the Laplace transform 
—Ss e725 e735 + e 4s e7>s 
S s S s s 


F(s)==- e 
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A second way of computing F(s) is to evaluate the integral 


[Penp(ndr= flentars frertart feat 


l—e~ —2s —3s — 4s —5s 
S S S 


Consequently, 
Y(s)= l—e +e 7% —e 84+ e788 -—e7 
s(s—1)(s—-2) 


Our next step is to expand 1/s(s—1)(s—2) in partial fractions; 1.e., we 
write 


l A B C 
= + + 
s(s—1)(s—2) s s-1l s-2 
This implies that 
A(s—1)(s—2)+ Bs(s—2)+ Cs(s—1)=1. (1) 


Setting s=0 in (1) gives A =; setting s=1 gives B= —1; and setting s=2 
gives C=. Thus, 

] 1 | l 1 
s(s-1)(s—2) 25 s-l 2 


Consequently, from Property 3, 


y(th=[z-e' + 36%] - Ay ()[p- e+ 30? | 
+ Hy (1)[3—e? + 30 | — Hy (1)[ 5-09 + 30 | 


+ H,(1)[4—e 9 + 40-9 | — He (t)[ 5 — e+ et“ ]. 


Remark. It is easily verified that the function 
5 _ el M4 sertt~n) 
and its derivative are both zero at t=n. Hence, both y(¢) and y’(¢) are con- 


tinuous functions of time, even though f(f) is discontinuous at t= 1, 2, 3, 4, 
and 5. More generally, both the solution y(t) of the initial-value problem 


d’y dy ’ , 
a tO TVS); V()=Yo Y'(bo) =% 
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and its derivative y’(t) are always continuous functions of time, if f(¢) is 
piecewise continuous. We will indicate the proof of this result in Section 
2.12. 


EXERCISES 


Find the solution of each of the following initial-value problems. 


1. 
2. 


3. 


y" +2y'+y=AUt—3)H(); y(O)=2, y'O)=1 
y” +y’ +y= H,()- H,,,(0); y(0)= l, y'(0)=0 


” = l, 0<t<4. = ; =— 
y” +4y ( 1>4 ; y(0)=3, y'(O)=-—2 


” _ sin t, O<t<r. = “m= 
4. y’ty={ sinh OStS* s yO=LyO=0 
nf cost, O<t<a7/2 | ny 
sy tya{ ee OS 2O=37O=—1 


7. 


; ytay'ty=| 


sin2t, O<t<a/2 . 1 w= 
0, n/2<t<oo° y(0)=1, y'(Q)=0 


ytyttya{t Freee? 2O=0,y'(0)=0 


2 
By'tye{h OSt<1s yO=0¥O=0 


0, l<t<oa 
0, O<t<l 


a ymayeyedt l<t<2; y(0)=0,y'0)=1 


10. 


11. 


0, 2<t<oo 
Find the Laplace transform of |sin¢|. Hint: Observe that 


oe) 
lsint}=sint+2 S H,,,(t) sin(t— 27). 


n=! 


Solve the initial-value problem of Example 4 by the method of judicious guess- 
ing. Hint: Find the general solution of the differential equation in each of the 
intervals O<¢<1, 1<t<2,2<t<3,3<t<4,4<1t<5, 5<t<o, and 
choose the arbitrary constants so that y(t) and y(t) are continuous at the 
points t= 1, 2, 3, 4, and S. 


2.12 The Dirac delta function 


In many physical and biological applications we are often confronted with 
an initial-value problem 


d* 
a 402 +o=K0;  rO=%e YO=%  — 
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Figure 1. The graph of a typical impulsive function f(t) 


where we do not know f(t) explicitly. Such problems usually arise when we 
are dealing with phenomena of an impulsive nature. In these situations, the 
only information we have about f(f) is that it is identically zero except for 
a very short time interval 4. <¢<1,, and that its integral over this time in- 
terval is a given number /,+#0. If J) is not very small, then f(t) will be 
quite large in the interval 4) <¢<t,. Such functions are called impulsive 
functions, and the graph of a typical f(f) is given in Figure 1. 

In the early 1930’s the Nobel Prize winning physicist P. A. M. Dirac de- 
veloped a very controversial method for dealing with impulsive functions. 
His method is based on the following argument. Let r, get closer and closer 
to fo. Then the function f(t)/J) approaches the function which is 0 for r# 
to, and oo for ¢=/%5, and whose integral over any interval containing /, is 1. 
We will denote this function, which is known as the Dirac delta function, 
by d(¢— %9). Of course, 5(¢— 5) is not an ordinary function. However, says 
Dirac, let us formally operate with 6(¢— 1,) as if it really were an ordinary 
function. Then, if we set f(t) = J)6(t— t)) in (1) and impose the condition 


| 
J 2(98(e-4)dr={ Bo) Hastost (2) 
a 0 otherwise 


for any continuous function g(f), we will always obtain the correct solution 
y(t). 


Remark. Equation (2) is certainly a very reasonable condition to impose 
on 6(t— fg). To see this, suppose that f(z) is an impulsive function which is 
positive for tj<¢<4,, zero otherwise, and whose integral over the interval 
[¢,¢,] is 1. For any continuous function g(P), 


[ amin, @(O[f09< gO F<_ max, 8()] FO. 


Consequently, - 
i “[ min, 800 [sas f"a(o Far i “[ max, 2(0)] fe 
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or 


min a(t)< f"8()f()at< max g(f). 
to to<t<t 


to<t<t, 


Thus, as 1,—>fo, i) "' 9(t) f(t) dt—>g(tp). 


Now, most mathematicians, of course, usually ridiculed this method. 
“How can you make believe that 6(¢—‘)) is an ordinary function if it is 
obviously not,” they asked. However, they never laughed too loud since 
Dirac and his followers always obtained the right answer. In the late 
1940’s, in one of the great success stories of mathematics, the French 
mathematician Laurent Schwartz succeeded in placing the delta function 
on a firm mathematical foundation. He accomplished this by enlarging the 
class of all functions so as to include the delta function. In this section we 
will first present a physical justification of the method of Dirac. Then we 
will illustrate how to solve the initial-value problem (1) by the method of 
Laplace transforms. Finally, we will indicate very briefly the “germ” of 
Laurent Schwartz’s brilliant idea. 


Physical justification of the method of Dirac. Newton’s second law of mo- 
tion is usually written in the form 


d 
me) =f) (3) 


where m is the mass of the particle, v is its velocity, and f(t) is the total 
force acting on the particle. The quantity mv is called the momentum of 
the particle. Integrating Equation (3) between ¢, and ¢, gives 


mv(t,;)— mv(ty) = J " (1) dt. 


This equation says that the change in momentum of the particle from time 


ty to time ¢, equals je f(t)dt. Thus, the physically important quantity is 


the integral of the force, which is known as the impulse imparted by the 
force, rather than the force itself. Now, we may assume that a >0 in Equa- 
tion (1), for otherwise we can multiply both sides of the equation by —1 to 
obtain a >0. In this case (see Section 2.6) we can view y(2), for tf < fp, as the 
position at time ¢ of a particle of mass a moving under the influence of the 
force — b(dy /dt)— cy. At time ¢, a force f(t) is applied to the particle, and 
this force acts over an extremely short time interval tg <¢<t,. Since the 
time interval is extremely small, we may assume that the position of the 
particle does not change while the force f(t) acts. Thus the sum result of 
the impulsive force f(7) is that the velocity of the particle jumps by an 
amount J,/a at time f9. In other words, y(‘) satisfies the initial-value prob- 
lem 

d? 

as +52 +ey=0; v(O)=y, y'(O)=y 
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for 0<t<%), and 


d? I 

a 46 +0=0;  r(W)=Zy YQ=at 2) 
for t > fg, where z, and zg are the position and velocity of the particle just 
before the impulsive force acts. It is clear, therefore, that any method 
which correctly takes into account the momentum J, transferred to the 
particle at time ¢, by the impulsive force f(t) must yield the correct answer. 
It is also clear that we always keep track of the momentum J, transferred 
to the particle by f(7) if we replace f(7) by [)5(t—¢t)) and obey Equation 
(2). Hence the method of Dirac will always yield the correct answer. 


Remark. We can now understand why any solution y(t) of the differential 


equation 
d*y _,& a | | 
a2 + b +cy=f(t), f(t) a piecewise continuous function, 
t 


is a continuous function of time even though f(t) 1s discontinuous. To wit, 
since the integral of a piecewise continuous function is continuous, we see 
that y’(¢), must vary continuously with time. Consequently, y(t) must also 
vary continuously with time. 


Solution of Equation (1) by the method of Laplace transforms. In order to 
solve the initial-value problem (1) by the method of Laplace transforms, 
we need only know the Laplace transform of 6(t— 9). This is obtained di- 
rectly from the definition of the Laplace transform and Equation (2), for 


£{8(t—t)} =f e-"8(t— to) dt= (for t, > 0). 
0 


Example 1. Find the solution of the initial-value problem 

d*y ,& ; 

We at A 38 DF 8-2); y(0)=1, y’(O)=1. 
Solution. Let Y(s)=£{y(t)}. Taking Laplace transforms of both sides of 
the differential equation gives 

s*Y —s—1—4(sY¥Y—1)+4Y=3e S+e-% 
or 
(s?-45+4)Y(s)=s—3+3e %+e-*. 
Consequently, 
_ —S —2s 
¥(s)= S34 32 4, en 
(s—2) (s-2)? (s-2) 
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Now, 1/(s—2)*=£ {te"}. Hence, 
3e75 es _ _ 
—— + = 2{3H, (t)(t—1)e— + A, (1)(t—2)e- 7}. 
Gop * Gop BHO) 2 (1)(t—2)e%-} 
To invert the first term of Y(s), observe that 


s—3 s—2 1 
ef 97} 8 1078), 
(s—2)? (s-2) (s-2) tego) 


Thus, y(t) =(1— t)e*+3H,(t)(t- le@- 9+ Hy(t)(t —2)e%¢-, 


It is instructive to do this problem the long way, that is, to find y(t) sep- 
arately in each of the intervals O< ¢<1,1<t<2and2<t<oo. For0<t< 
1, y(t) satisfies the initial-value problem 

d*y _ ,& 

We dt +4y=0; = y(O)=1, y(O)=1. 
The characteristic equation of this differential equation is r?—4r+4=0, 
whose roots are r,=r,=2. Hence, any solution y(t) must be of the form 
y(t)=(a,+a,t)e”". The constants a, and a, are determined from the initial 
conditions 


l=y(0)=a, and 1=y’(0)=2a,+a). 


Hence, a,=1, a,= —1 and y(t)=(1—de” for 0<t<1. Now y(1)=0 and 
y(1)= —e*. At time t=1 the derivative of y(t) is suddenly increased by 3. 
Consequently, for 1 < ¢<2, y(t) satisfies the initial-value problem 

d*y dy D 2 

72 —4— +4y=0;  y(I)=0, y'(I)=3-e°. 
Since the initial conditions are given at t=1, we write this solution in the 
form y(¢)=[b, + b,(¢— 1)Je*“~ ? (see Exercise 1). The constants b, and 5, 
are determined from the initial conditions 


O=y(1)=5, and 3-—e*=y’(1)=2b,+)). 

Thus, b, =0, b,=3—e? and y(t) =(3 — e*\(t — le*’-), 1< t<2. Now, 
y(2)=(3 — ee? and y’(2)=3(3—e7)e”. At time t=2 the derivative of y(t) 
is suddenly increased by 1. Consequently, for 2 < t< oo, y(t) satisfies the 
initial-value problem 

d? d 

<4 44y=0; — y(2)=e2(3-e2),_ y'(2)=143e2(3—-e). 

dt dt 
Hence y(t)=[c, +c,(t—2)]e*’-”. The constants c, and c, are determined 
from the equations 


e7(3—e*)=c, and 1+3e?(3—e7)=2c, +c. 
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Thus, 
c,=e*(3—e7), cp = 1+3e?(3— e*) —2e7(3 — e*?) = 1+ e7(3 — e”) 


and y(t)=[e2(3 — e*) +(1 + e7(3 — e”))(t — 2) Je, > 2. The reader should 
verify that this expression agrees with the expression obtained for y(t) by 
the method of Laplace transforms. 


Example 2. A particle of mass 1 1s attached to a spring dashpot mechanism. 
The stiffness constant of the spring is 1 N/ft and the drag force exerted by 
the dashpot mechanism on the particle is twice its velocity. At time ¢ = 0, 
when the particle is at rest, an external force e ‘is applied to the system. At 
time ¢ = 1, an additional force f(t) of very shor: duration is applied to the 
particle. This force imparts an impulse of 3 N-s to the particle. Find the 
position of the particle at any time ¢ greater than 1. 

Solution. Let y(t) be the distance of the particle from its equilibrium 
position. Then, y(t) satisfies the initial-value problem 


d*y a - 
a t2G tyme '+35(1— VD): y(0)=0, y’(0)=0. 


Let Y(s)=£{(t)}. Taking Laplace transforms of both sides of the dif- 
ferential equation gives 


3e ° 


aT, "G4 


(s? +2s+1)¥(s)=— > +375 or Y(s)= 


Since 
7 ~ ce se" —(t—- 
(s+1)° =0| 5 and +1? =32{ A, (t)(t—-le ( D\ 
we see that 


a Come 


y(y=t 


Consequently, y(t) = 5 t’e~'+3(t—Ne~~ for r>1. 


We conclude this section with a very brief description of Laurent 
Schwartz’s method for placing the delta function on a rigorous mathemati- 
cal foundation. The main step in his method is to rethink our notion of 
“function.” In Calculus, we are taught to recognize a function by its value 
at each time ¢. A much more subtle (and much more difficult) way of re- 
cognizing a function is by what it does to other functions. More precisely, 
let f be a piecewise continuous function defined for — 00 <t< oo. To each 
function ¢ which is infinitely often differentiable and which vanishes for |¢| 
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sufficiently large, we assign a number K[@] according to the formula 


Klo]= fo oo fat (5) 


As the notation suggests, K is an operator acting on functions. However, it 
differs from the operators introduced previously in that it associates a 
number, rather than a function, with . For this reason, we say that K[9] is 
a functional, rather than a function. Now, observe that the association ¢—> 
K[9] is a linear association, since 


K[ ¢,$, + C292] = [- (C1, + Cob2)(t) f(t) at 
=cif" a(S dtteaf olds oa 


=c,K{o,]+¢,K[ 4]. 


Hence every piecewise continuous function defines, through (5), a linear 
functional on the space of all infinitely often differentiable functions which 
vanish for |t| sufficiently large. 

Now consider the functional K[¢] defined by the relation K[¢]=$(¢p). 
K is a linear functional since 


K[ ¢,6, + Coby | = C11 (to) + C22 (lo) = c,K[$,]+¢.K[ 9]. 
To mimic (5), we write K symbolically in the form 


Ko]= [- 9(1)8(¢— to) (6) 


In this sense, 5(t¢— f,) is a “generalized function.” It is important to realize 
though, that we cannot speak of the value of 5(t—)) at any time ¢. The 


only meaningful quantity is the expression f ~ o(t)d(t— to) dt, and we 
must always assign the value $(¢)) to this expression. 

Admittedly, it is very difficult to think of a function in terms of the lin- 
ear functional (5) that it induces. The advantage to this way of thinking, 
though, is that it is now possible to assign a derivative to every piecewise 
continuous function and to every “generalized function.” To wit, suppose 


that f(t) is a differentiable function. Then f’(¢) induces the linear func- 
tional 


K'[e]= ff . $(t) f'() dt. (7) 


Integrating by parts and using the fact that ¢(/) vanishes for |t| sufficiently 
large, we see that 


K'el]=f" [-#¢@lya=x[-6'], (8) 
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Now, notice that the formula K’[¢]= K[—¢’] makes sense even if f(t) ts 
not differentiable. This motivates the following definition. 


Definition. To every linear functional K[] we assign the new linear func- 
tional K’[d] by the formula K'[¢]= K[—¢’]. The linear functional K [9] 
is called the derivative of K[@] since if K[¢] is induced by a differentia- 
ble function f(t) then K’[] is induced by f’(£). 


Finally, we observe from (8) that the derivative of the delta function 
6 (t— to) is the linear functional which assigns to each function ¢ the num- 
ber —¢'(t0), for if K[¢]=¢(t,) then K’[¢]= K{—¢']= —$(to). Thus, 


f o0)8'U~ te) dt = —4'(t) 
for all differentiable functions ¢$(f). 


EXERCISES 


1. Let a be a fixed constant. Show that every solution of the differential equation 
(d*y /dt*)+2a(dy /dt)+ a*y =0 can be written in the form 


y(t)=[c,+¢,(t-a)Je~ 7. 


2. Solve the initial-value problem (d7y /dt?)+4(dy /dt)+5y =f(1); YO)=1, y'(O)= 
0, where f(t) 1s an impulsive force which acts on the extremely short time inter- 


val 1<¢<1+7, and fsoa=2. 


3. (a) Solve the initial-value problem (dy /dt?) — 3(dy/dt)+2y =f(0); y(0)=1, 
y'(0)=0, where f(t) is an impulsive function which acts on the extremely 


short time interval 2< ¢<2+17, and f (Odi =—], 


(b) Solve the initial-value problem (d*y / dt?) —3(dy/dt)+2y =0; y(0)=1, y’(0) 
=0, on the interval 0< ¢<2. Compute z9=y(2) and zp=y’(2). Then solve 
the initial-value problem 


d’y_,W (Q)=2! 
a a 127 =9: y (2) = Zo, y'(2)=z9-1, 2<t<oo. 


Compare this solution with the solution of part (a). 


4. A particle of mass | is attached to a spring dashpot mechanism. The stiffness 
constant of the spring is 3 N/m and the drag force exerted on the particle by the 
dashpot mechanism is 4 times its velocity. At time ¢ = 0, the particle is stretched 4 
m from its equilibrium position. At time ¢ = 3 seconds, an impulsive force of very 
short duration is applied to the system. This force imparts an impulse of 2 N-s to 
the particle. Find the displacement of the particle from its equilibrium position. 
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In Exercises 5—7 solve the given initial-value problem. 
d? 


5, <a ty sint + 3(t— y(0)=0, y(0)=0 
6 a7 9 Lye =28(t-1)—8(t-2);_ y(0)=1, y'(0)=0 
‘ dt at 
d*y dy —t , 
7. me ta tyme +36(¢—3); y(0)=0, y’(0)=3 


8. (a) Solve the initial-value problem 


dy 


tee S 6(t-jn), _y(0)=y'(0)=0, 


jJ=0 
and show that 


sin f, neven 
y= or n odd 


in the interval nz <t<(n+1)z. 
(b) Solve the initial-value problem 


ay 
“J +y= ¥ 6(-2m),  yO=y'O=0 
j=0 


and show that y(¢t)=(n+ 1)sint¢ in the interval 2n7 << t <2(n+ 1). 
This example indicates why soldiers are instructed to break cadence when 
marching across a bridge. To wit, if the soldiers are in step with the natural 
frequency of the steel in the bridge, then a resonance situation of the type (b) 
may be set up. 


9. Let f(t) be the function which is + for ¢>¢%o, 0 for =f, and —1 for t< fo. Let 
K [$] be the linear functional 


Klel=f™ oo soat. 


Show that K’[¢]= K[-—9’]=¢(to). Thus, 5(t— 1%) may be viewed as the deriva- 
tive of f(t). 


2.13 The convolution integral 


Consider the initial-value problem 
2 


d“y dy 
az tbG teal; yO=% YO=%  — (H 


Let Y(s)=£{y(t)} and F(s)=L£{ f(H}. Taking Laplace transforms of 
both sides of the differential equation gives 


al s°Y (s)— syo—yo| + bl sY(s)—yo| +cY(s)=F(s) 
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and this implies that 


as+b a F(s) 
Y (s) = —————_- py, + —————_- yp, + —_—_————__. 
(s) as?+bstc>° ast+bste>° as?+bs+c 
Now, let 
_ b 
vil) as*+ bs+c 
and 


—p-1j_ @ 
y(t)=k | as*+ bs+c } 
Setting f(¢)=0, yp=1 and yj =0, we see that y,(Z) is the solution of the ho- 
mogeneous equation which satisfies the initial conditions y,(0)=1, y}(0)= 
0. Similarly, by setting f(¢)=0, y>=0 and yj=1, we see that y,(f) is the 
solution of the homogeneous equation which satisfies the initial conditions 
y(0) =0, y3(0)=1. This implies that 


F(s) | 


as? + bs+c 


yoyney 


is the particular solution of the nonhomogeneous equation which satisfies 
the initial conditions y(0)=0, (0) =0. Thus, the problem of finding a par- 
ticular solution y(t) of the nonhomogeneous equation is now reduced to 
the problem of finding the inverse Laplace transform of the function 
F(s)/(as? + bs +c). If we look carefully at this function, we see that it is 
the product of two Laplace transforms; that is 


——=2( 709) xe 


as*+bs+c a 


y(t) | 


It is natural to ask whether there is any simple relationship between y(t) 
and the functions f(t) and y,(t)/a. It would be nice, of course, if y(t) were 
the product of f(t) with y,(¢)/a, but this is obviously false. However, there 
is an extremely interesting way of combining two functions f and g 
together to form a new function f*g, which resembles multiplication, and 
for which 


Ei(fesy(t)}=L{ FD} X£{ a}. 


This combination of f and g appears quite often in applications, and is 
known as the convolution of f with g. 


Definition. The convolution (f* g)(t) of f with g is defined by the equation 


(feay=f “f(t=u) g(u) du. (2) 
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For example, if f(t)=sin2¢ and g(t)= e”, then 


(fea)(s)= f'sin2(e—u)e" du, 


The convolution operator + clearly bears some resemblance to the mul- 
tiplication operator since we multiply the value of f at the point t—wu by 
the value of g at the point u, and then integrate this product with respect to 
u. Therefore, it should not be too surprising to us that the convolution op- 
erator satisfies the following properties. 


Property 1. The convolution operator obeys the commutative law of multi- 
plication; that is, (f*g)(t)=(g*f)(2). 
Proor. By definition, 


(Seay()= f'S(—u) au) du 


Let us make the substitution ¢— u=s in this integral. Then, 
0 
(feay(t)=— f f(s) a(t—s)ds 
t 
= a(t) f(s) ds=(f)(2), o 


Property 2. The convolution operator satisfies the distributive law of mul- 
tiplication; that is, 


fe(gthy=fegtfeh. 
Proor. See Exercise 19. a 
Property 3. The convolution operator satisfies the associative law of multi- 
plication; that is, ({*g)*h=f*(g*h). 


PrRooFr. See Exercise 20. LJ 


Property 4. The convolution of any function f with the zero function is 
Zero. 


Proor. Obvious. LJ 


On the other hand, the convolution operator differs from the multiplica- 
tion operator in that f*14f and f*f#f?. Indeed, the convolution of a 
function f with itself may even be negative. 


Example 1. Compute the convolution of f(#) =? with g(t)=1. 
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Solution. From Property 1, 


P 


(feg)(t)=(g*f\(O= fo -u-du= 3° 


Example 2. Compute the convolution of f(#)=cost with itself, and show 
that it is not always positive. 
Solution. By definition, 


(F+f\(1) = [cos(t—u) cosudu 
0 
= f ‘(cost cos2u + sintsinu cosu) du 
0 


t t 
=cost [ AA COS2H dy + sint f sinucosudu 
0 0 


t . sin2t]. sin?t 
=cost| + 4 7 
_ tcost+sintcos*¢+sin°¢ 

2 
t cost +sint(cos”f + sin’) 

=. 
_ tcost+sint 

2 


This function, clearly, is negative for 
(2n+1)a<1¢<(2n+1)7+47, n=0,1,2,.... 


We now show that the Laplace transform of f*g is the product of the 
Laplace transform of f with the Laplace transform of g. 


Theorem 9. 2 {(f*g)()}=£{f()} x L{ g(d}. 
ProoF. By definition, 


° t 
E{(fes\(O}= [ ol f f(t- W) e(wa| 
0 
This iterated integral equals the double integral 
ffem#- u) g(u)dudt 
R 


where R is the triangular region described in Figure 1. Integrating first 
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ey 
t 
Figure 1 
with respect to ¢, instead of u, gives 
i(f+8)(2)} -{ g(w)| fe" f(t—u | 
0 u 
Setting t—u=, we see that 
f "ef (t—u)dt= f “e~SU+® £(B) dé. 
u 0 
Hence, 
e(e(d}= [ao] [ene xr eae |e 
0 
=| [7 we-maull [e-etp(@ae| 
0 0 
=L{f(O}x&{2(d}. O 
Example 3. Find the inverse Laplace transform of the function 
a 
s*(s*+a’) 


Solution. Observe that 


a) and 45 =2(sinar}. 


2 az 


Hence, by Theorem 9 
t 
e'{ 2} - t—u)sinau du 
s?(s*+a7) J ( 


_ at—sinat 


az 
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Example 4. Find the inverse Laplace transform of the function 


l 
s(s?+25+2) 
Solution. Observe that 
l l | yh. 
—=f{1} and ——— = ————— =ffe'sint}. 
S (1) s*+254+2 (st+1) +1 


Hence, by Theorem 9, 


ey —_! | = fe“sinudy 
s(s*+2s+2) 0 


= 5 [1-e7'(cosr-+sins)]. 


Remark. Let y,(¢) be the solution of the homogeneous equation ay” + by’ 
+ cy =0 which satisfies the initial conditions y,(0)=0, y,(0)=1. Then, 


H()=f()4 29 (3) 


is the particular solution of the nonhomogeneous equation ay” + by’+cy = 
f(t) which satisfies the initial conditions ¥(0)=y’(0)=0. Equation (3) is 
often much simpler to use than the variation of parameters formula de- 
rived in Section 2.4. 


EXERCISES 


Compute the convolution of each of the following pairs of functions. 


1. e%, ce" axzb 2. e@, e@ 
3. cosat, cos bt 4, sinat, sinbt, axb 
5. sinat, sinat 6. ¢, sint 


Use Theorem 9 to invert each of the following Laplace transforms. 


_! ee 9, —5__ 
s?(s?+1) (s+1)(s2+4) ~ (5241) 
] l l 
10. —~—_ 11. ————~ 12. 
s(s?+1) s2(s+1) (s?+1) 


Use Theorem 9 to find the solution y(t) of each of the following integro- 
differential equations. 


13. y(t)=4t-3 f ‘y(u)sin(t— u)du 
0 
t . 

14, y()=41—3 [ y(t—u)sinudu 
0 
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15. y(Q=sint+ ['y(t—u)cosudu, y(0)=0 
0 
16. y(t) =422— i) ‘y(uye ~~) du 
0 
t . 
17. y'(t)+2y+ f 'y(u)du=sint, y(0)=1 
0 


18. y(nat—et ['y(uje “du 

0 
19. Prove that fe(g+h)=fagtf«h. 
20. Prove that (f+g)*h=f«*(g*h). 


2.14 The method of elimination for systems 


The theory of second-order linear differential equations can also ve used 
to find the solutions of two simultaneous first-order equations of the form 


x= Bmax tony tf(0 
(1) 
‘= & =c(t)x+d(t)yt+g(t). 


The key idea is to eliminate one of the variables, say y, and then find x as 
the solution of a second-order linear differential equation. This technique 
is known as the method of elimination, and we illustrate it with the follow- 
ing two examples. 


Example 1. Find all solutions of the simultaneous equations 


x=2x+ytt 
(2) 
y=ext3ytl. 
Solution. First, we solve for 
y=x'-2x-t (3) 


from the first equation of (2). Differentiating this equation gives 
y=ux”—2x’-1l=xt+3yt+1. 
Then, substituting for y from (3) gives 
x" —2x'-1=x+3(x'-2x-t)+1 
so that 
x” —~5x’'+5x=2—3t. (4) 

Equation (4) is a second-order linear equation and its solution 1s 

x(t)=e%/?| ce [24 cye~ YS 1/2] — (+3%) ™ 
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for some constants c, and c,. Finally, plugging this expression into (3) gives 
y(t)=er/? I+V5 +5 coy 1/24 i-Vo YS ce7 V5 1/2 + > LV 


Example 2. Find the solution of the initial-value problem 
x'=3x-y, x(0)=3 


(5) 
y=xty, y(0)=0. 
Solution. From the first equation of (5), 
y=3x-x’. (6) 


Differentiating this equation gives 
y=3x'-x"=x+y. 
Then, substituting for y from (6) gives 
3x’ — x” =x+3x-x’ 
so that 
x” —4x’ +4x=0. 
This implies that 
x(t) =(c, + ct) e”! 
for some constants c,, c,, and plugging this expression into (6) gives 
y(t)=(¢,— cy + cyt) e”. 
The constants c, and c, are determined from the initial conditions 
x(0)=3=c, 
y (0) =0=c,—c¢). 
Hence c, =3, c,=3 and 
x(t)=3(1+ te, y(t) =3 te” 
is the solution of (5). 
Remark. The simultaneous equations (1) are usually referred to as a first- 


order system of equations. Systems of equations are treated fully in 
Chapters 3 and 4. 


EXERCISES 
Find all solutions of each of the following systems of equations. 


1. x’=6x—3y 2. x'=—Qx+ytt 
y=dxt+y y= —4x+3y—-1 
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3. x’=—3x+2y 4. x’=xt+yte! 
ys —x—-y y=x-y-e! 


Find the solution of each of the following initial-value problems. 


5. x’=xt+y, x(0)=2 6. x’=x—3y, x(0)=0 
y=4xt+y, y0)=3 y'=—2x4+2y, y(O)=5 
7, x'=x-y, x(0)=1 8. x’=3x-2y, x(0)=1 
y =5x—-3y, yO)=2 y'=4x-y, y(O)=5 
9. x'=4x+5y+4e'cost, x(0)=0 10. x'=3x—-—4yt+e'’, x(O)=1 
y= —2x-2y, yO=0 —_y'=x-yte,  yO=1 
Wl. x’=2x—S5y+sint, x(0)=0 12. x’=y +f, (0), x(0)=0 
y =x—2yt+tant, y(0)=0 ‘=—x+f,(t), y(0)=0 


2.15 Higher-order equations 


In this section we briefly discuss higher-order linear differential equations. 


Definition. The equation 


dq" q’—! 
Ly] = a, (1) +4, +... 4a9(t)y=0, a,(t)#0 (1) 
is called the general nth order homogeneous linear equation. The dif- 
ferential equation (1) together with the initial conditions 


V(t) =Yor Y(t) = YO-YO (tg) ="? (1’) 


is called an initial-value problem. The theory for Equation (1) is com- 
pletely analogous to the theory for the second-order linear homoge- 
neous equation which we studied in Sections 2.1 and 2.2. Therefore, we 
will state the relevant theorems without proof. Complete proofs can be 
obtained by generalizing the methods used in Sections 2.1 and 2.2, or by 
using the methods to be developed in Chapter 3. 


Theorem 10. Let y,(¢),...,y,(¢) be n independent solutions of (1); that is, no 
solution y,(t) is a linear combination of the other solutions. Then, every 
solution y(t) of (1) is of the form 

yOD=cy (+... +¢,y,(0) (2) 


for some choice of constants c,,...,c,. For this reason, we say that (2) is 
the general solution of (1). 


To find n independent solutions of (1) when the coefficients ao, a,,...,4, 
do not depend on ¢, we compute 
L[e"|=(a,r"+a,_yr"~' +... tape”. (3) 
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This implies that e” is a solution of (1) if, and only if, r is a root of the 
characteristic equation 


a,r"+a,_yr" 1+... ta,=0. (4) 


Thus, if Equation (4) has n distinct roots r,,...,7,, then the general solution 
of (1) is y(t)= ce" +... +0,e. If r=a,+i8 is a complex root of (4), 
then 

u(t)=Ref{e% }=e%' cosBt 
and 

o(t)=Im{e% } =e%'sin Bit 
are two real-valued solutions of (1). Finally, if 7, is a root of multiplicity k; 
that is, if 

a,r"+...+aj=(r- r:)‘q(r) 
where q(r,)#0, then e’”, te’’,...,t*~'e" are k independent solutions of 
(1). We prove this last assertion i in the following manner. Observe from (3) 
that 

Llet]=(r—n)*a(nye" 


if r, is a root of multiplicity k. Therefore, 


L| He™ |] = “1 xe" 


or/ =r, 

g/ 

— — ff e” 
dr e"] r=r, 
oJ , 

=a ri)'q(rje" 

=0,forl<j<k. 

Example 1. Find the general solution of the equation 

d*y 
—+y=0. 5 
Wa ty (5) 


Solution. The characteristic equation of (5) is r4+1=0. We find the roots 
of this equation by noting that 


—l=e7= eo7t = oom = elt. 
Hence, 
=— pit/4_ 7 l . 
r,=e / =cos 7 + isin 47 “Gq Ut): 
= p37i/4 3a 3m _ __1 —j] 
ro=e cos + isin 4 vy | i), 
r= 68/4 = co 27 + isin 22 = -~Lis+a, 
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and 


7e'/4= cos + isin 2 = = veal —i) 
are 4 roots of the equation r*+ 1=0. The roots r,; and r, are the complex 


conjugates of r, and r,, respectively. Thus, 


rg=e 


ert = gt/V2 [cos + isin | 
V2 V2 


and 


_ Loe t 
ert = e-t/V2 cos + isin | 
V2 V2 


are 2 complex-valued solutions of (5), and this implies that 


v2 


y(t) =e’? cos v2 


. t 
y,(t)=e/? sin —, 
2(¢) VF 


t 
V2 
_— ,-t/V2 t _ ,—-t/V2 ¢: t 
y3(t)=e—'/%? cos ——, and y,(t)=e sin —— 


are 4 real-valued solutions of (5). These solutions are clearly independent. 
Hence, the general solution of (5) is 


V2 t ot 
y(t)=e'/ ascos +b, sin | 
) V7 OV 
t  ¢ 

+ b,sin —— |. 
v2” | 


Example 2. Find the general solution of the equation 


+e? a cos —— 


—— —3—~ 43-=— = =0, (6) 


Solution. The characteristic equation of (6) is 
O=74-37r°4+37-—r= r(r?—3r2+3r- 1) 
=r(r—1)’. 


Its roots are r, =0 and r,=1, with r,=1 a root of multiplicity three. Hence, 
the general solution of (6) is 


y(t)=e, +(e, + cyt + cyt?)e’. 
The theory for the nonhomogeneous equation 
d"y 
Ll y=) 55 +... +ag()y=F (1), 4,(0) #0 (7) 
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is also completely analogous to the theory for the second-order nonhomo- 
geneous equation. The following results are the analogs of Lemma | and 
Theorem 5 of Section 2.3. 


Lemma 1. The difference of any two solutions of the nonhomogeneous equa- 
tion (7) is a solution of the homogeneous equation (1). 


Theorem 11. Let W(t) be a particular solution of the nonhomogeneous equa- 
tion (7), and let y,(t),...,¥,(t) be n independent solutions of the homoge- 
neous equation (1). Then, every solution y(t) of (7) is of the form 


VDP +e yi (+... +6 In() 


for some choice of constants C,, C,...,Cy- 


The method of judicious guessing also applies to the nth-order equation 


a, a + LF agy=[bot btt... +b t* le”. (8) 


It is easily verified that Equation (8) has a particular solution y(¢) of the 
form 


W(t)h=[cotet+... tot" lem 

if e“ is not a solution of the homogeneous equation, and 
W(th=t[egtett...+¢,t* lem 

if /—'e* is a solution of the homogeneous equation, but He is not. 


Example 3. Find a particular solution (7) of the equation 
dy d*y dy 
Lly]=—7 5 +375 +3 tye! (9) 
Solution. The characteristic equation 


P43r24+3r+1=(r4+1)y 


has r= — 1 as a triple root. Hence, e‘ is not a solution of the homogeneous 
equation, and Equation (9) has a particular solution y(t) of the form 
W(t) = Ae’. 


Computing L[y](1) =8Ae’‘, we see that A = +. Consequently, y(t)= e' is a 
particular solution of (9). 


There is also a variation of parameters formula for the nonhomoge- 
neous equation (7). Let v(z) be the solution of the homogeneous equation 
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(1) which satisfies the initial conditions v(t,)=0, v’(tp) =0,...,0%~ (to) =0, 


v(t.) =1. Then, 
v(t)= [oe PC) F(syas 


is a particular solution of the nonhomogeneous equation (7). We will prove 
this assertion in Section 3.12. (This can also be proven using the method of 
Laplace transforms; see Section 2.13.) 


EXERCISES 
Find the general solution of each of the following equations. 
1, yy" —2y"”—y'+2y=0 2. y’— by” +5y’+ lL2y =0 
3. ya) — Sy" + 6y” +4y’—8y =0 4. y"—y"+y’—y=0 
Solve each of the following initial-value problems. 
5. y+ 4y'" + 14y” —20y'+25y =0; y(0)=y'(0)=y"(0)=0, y’”(0) =0 
6. y™—y=0; yO)=1, O=y"O=9, yO = -1 
7. y—2y4y"=0; yO)=y'O=v"O=y"O=0, yO) = - 1 
8. Given that y,(f)=e' cost is a solution of 
yO— dy" +y" +2y’—2y =0, (*) 


find the general solution of (*). Hint: Use this information to find the roots of 
the characteristic equation of (*). 


Find a particular solution of each of the following equations. 


9. y’’+y’=tant 10. y—y=g(2) 

11. y+ y=2/(2) 12. y'"+y’=207+4sint 

13. y’” —4y’=t+cost+2e~~ 14. y—y=t+sint 

15. yM+42y"+y=Psint 16. y+ y"=2 

yl +y"+y'+y=tte' 18. y™+4y'" +6y"” +4y’+y=Pre7! 


Hint for (18): Make the substitution y = e~ ‘v and solve for v. Otherwise, it will 
take an awfully long time to do this problem. 
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Systems of differential equations 


3.1 Algebraic properties of solutions of linear systems 


In this chapter we will consider simultaneous first-order differential equa- 
tions in several variables, that is, equations of the form 


dx, 

dt. =f (4,1, 665 Xp)s 

ax, 

GT btn): (1) 
dx 


7s =f (t,X1,...,X,): 
A solution of (1) is n functions x,(2),...,x,(¢) such that dx,(t)/dt= 
F(t, x,(2),...,x, (0), 7=1,2,...,n. For example, x,(t)=¢ and x,(t)=?? is a 
solution of the simultaneous first-order differential equations 

dx, 4 dx 5 

de OS gy 
since dx,(t)/dt=1 and dx,(t)/dt=2t=2x,(t). 

In addition to Equation (1), we will often impose initial conditions on 

the functions x,(¢),...,x,(¢). These will be of the form 


X4 (to) = Xp, Xp (to) = XQ, -++s Xp (to) = Xp (1’) 
Equation (1), together with the initial conditions (1)’, is referred to as an 
initial-value problem. A solution of this initial-value problem is n functions 
x,(t),.-.,X, (4) which satisfy (1) and the initial conditions 
X (to) = X75---5%n (lo) = Xn 
For example, x,(¢)=e' and x,(t)=1+ e*'/2 is a solution of the initial- 
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value problem 


dx, 

ne x,(0)=1, 
dx, 3 
> el x,(0)=>5, 


since dx,(t)/dt=e'=x,(t), dx,(t)/dt =e” = x(t), x,(0)=1 and x,(0)=3. 

Equation (1) is usually referred to as a system of n first-order differen- 
tial equations. Equations of this type arise quite often in biological and 
physical applications and frequently describe very complicated systems 
since the rate of change of the variable x, depends not only on ¢ and x,, but 
on the value of all the other variables as well. One particular example is the 
blood glucose model we studied in Section 2.7. In this model, the rates of 
change of g and h (respectively, the deviations of the blood glucose and net 
hormonal concentrations from their optimal values) are given by the equa- 
tions ; 

= = B= — myh+ mgs. 

This is a system of two first-order equations for the functions g(t) and h(t). 

First-order systems of differential equations also arise from higher-order 
equations for a single variable y(t). Every nth-order differential equation 
for the single variable y can be converted into a system of n first-order 
equations for the variables 


—m,g—mh+J(t), 


d"-'y 
at"! ° 


dy 
n(=y, (=D (= 
Examples 1 and 2 illustrate how this works. 
Example 1. Convert the differential equation 
d"y d"~y 
a,(t) ar +a, () +... +a, y =0 


into a system of n first-order equations. 
Solution. Let x,(t)=y, x(t)= dy /dt,..., and x,(t)=d"'y/dt"—'. Then, 


dx, dx ax, —| 
a Fr Sa ie 
and 
aX, __ A, (tf) X_q + Ay 7 (1) Xp H.-F AQ] 
at a,(t) 


Example 2. Convert the initial-value problem 
d? ? 

5 +(Z) +3=65 yO=1 yYO=0, y"@=0 
into an initial-value problem for the variables y, dy /dt, and d’y / dt?. 


265 


3 Systems of differential equations 


Solution. Set x,(t)=y, x,(t)=dy/dt, and x,(t)= dy /dt*. Then, 
dx, dx, dx; og 
aX a aac am? — x3—3x). 

Moreover, the functions x,, x2, and x, satisfy the initial conditions x ,(0)= 

1, x,(0)=0, and x,(0)=0. 


If each of the functions f,,...,f, in (1) is a linear function of the depen- 
dent variables x,,...,x,, then the system of equations is said to be linear. 
The most general system of n first-order linear equations has the form 


ax 
= =a,,(t)x, +... +a,,(t)x, +2; (t) 


. (2) 
dx,, 
dt = nl (t)x, + “ee + Ann (t) Xp +g,,(t). 
If each of the functions g,,...,g, is identically zero, then the system (2) is 
said to be homogeneous; otherwise it is nonhomogeneous. In this chapter, 


we only consider the case where the coefficients a4, do not depend on 1. 
Now, even the homogeneous linear system with constant coefficients 


dx, 
“dt. =A4),x,+ eee + a,,X; 
, (3) 
dx, 
dt. = mee eee + a,),Xp 


is quite cumbersome to handle. This is especially true if n is large. There- 
fore, we seek to write these equations in as concise a manner as possible. 
To this end we introduce the concepts of vectors and matrices. 


Definition. A vector 


is a shorthand notation for the sequence of numbers x,,...,x,. The 
numbers x,,...,x, are called the components of x. If x,=x,(0),..., and 
x, =X,(t), then 


x, (t) 
x(t) = 
X,(t) 
is called a vector-valued function. iis derivative dx(t)/dt is the vector- 
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valued function 


dx, (t) 
at 
dx,,(t) 
at 
Definition. A matrix 
ay, 412 Qin 
ay, 42 Arp 
A= . . 
Qn Qn2 Qmnn 


is a shorthand notation for the array of numbers a, arranged in m rows 
and n columns. The element lying in the ith row and jth column is de- 
noted by a,, the first subscript identifying its row and the second sub- 


script identifying its column. A is said to be a square matrix if m=n. 
Next, we define the product of a matrix A with a vector x. 


Definition. Let A be an n Xn matrix with elements a, and let x be a vector 
with components x,,...,x,. We define the product of A with x, denoted 
by Ax, as the vector whose ith component is 


Aj\X, + AoX_+...+a,,x i=1,2,...,n. 


in“ ~n? 


In other words, the ith component of Ax is the sum of the product of 
corresponding terms of the ith row of A with the vector x. Thus, 


Qyy Gy2 eee Ay | | XY 
Q>] a> eee a>, X5 
Ax= . . 
Qnty An Ann Xn 
AyjX, FAX 7+... $a,,X, 
xX, FAX +... Hay, X, 
A,X} + Ay X54 $4, 
For example, 
1 2 4)[3 3+4+4 1] 
—-1 O 6/);2|=| —34+0+6|=| 3). 
I 1 Ij{l 34+2+1 6 
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Finally, we observe that the left-hand sides of (3) are the components of 
the vector dx/dt, while the right-hand sides of (3) are the components 


of the vector Ax. Hence, we can write (3) in the concise form 


Qi Q12 eee Qin 
x} 
dx a>, A> eee a>, 
x= — =Ax, where x=]: and A= 
at : 
x, 
Qny Ang ee Ann 


Moreover, if x,(¢),...,x,,(¢) satisfy the initial conditions 
X1 (to) =X}, 00+) Xn (to) = Xp» 
then x(t) satisfies the initial-value problem 


0 
| xy 


X=Ax,  x(t))=x°, where x°= 


Zo °° 


For example, the system of equations 


dx, 

7 =3x,—7x,+9x, 
ax, 

dt = 15x, + X5— X3 
dx, 

Tr =7x,+6x, 


can be written in the concise form 


7 0 6 


and the initial-value problem 


dx, 

dx, 

at. =3x,- X35 x,(0) =0 
dx, 

77 +7x3, x,(0)= —1 


can be written in the concise form 
1 -I] l l 
x=]0 3. -—I1]x, x(O)=| Of}. 
l 0 7 
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Now that we have succeeded in writing (3) in the more manageable 
form (4), we can tackle the problem of finding all of its solutions. Since 
these equations are linear, we will try and play the same game that we 
played, with so much success, with the second-order linear homogeneous 
equation. To wit, we will show that a constant times a solution and the 
sum of two solutions are again solutions of (4). Then, we will try and show 
that we can find every solution of (4) by taking all linear combinations of a 
finite number of solutions. Of course, we must first define what we mean 
by a constant times x and the sum of x and y if x and y are vectors with n 
components. 


Definition. Let c be a number and x a vector with n components x,,...,x,. 
We define cx to be the vector whose components are cx),...,CX,, that is 
xy CX) 
X5 CX 
cCX=C =|. 
x Cx 


For example, if 


3 3 6 
c=2 and x=/1|, then 2x=2/1/=]| 2]. 
7 7 14 


This process of multiplying a vector x by a number c 1s called scalar 
multiplication. 


Definition. Let x and y be vectors with components x,,...,x, and yj,...,¥, 
respectively. We define x+y to be the vector whose components are 
X,; +) ,--->X, +Y,, that is 


x} yy xX;+y, 
X4 y2 Xt V2 
X+ty=|. [+]. /= ; 
Xn} | Xn) | Xt, 
For example, if 
] —] 
_ 16 _| —6 
x= 5 and y 7 |? 
2 9 
then 
l —] 0 
6 —6 0 
+y=|O}4 = 
ery | 3 71 110 
2 9 11 


This process of adding two vectors together is called vector addition. 
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Having defined the processes of scalar multiplication and vector addi- 
tion, we can now state the following theorem. 


Theorem 1. Let x(t) and y(t) be two solutions of (4). Then (a) cx(t) is a solu- 
tion, for any constant c, and (b) x(t)+y(t) is again a solution. 


Theorem | can be proven quite easily with the aid of the following 
lemma. 


Lemma. Let A be an nXn matrix. For any vectors x and y and constant c, 
(a) A(cx) = cAx and (b) A(x+y)=Ax+ Ay. 


PROOF OF LEMMA. 

(a) We prove that two vectors are equal by showing that they have the 
same components. To this end, observe that the ith component of the vec- 
tor cAx 1s 


CO) X + COX + 10. H+ CAigXp_ = C( GX, +... + in Xn)s 
and the ith component of the vector A(cx) 1s 
Giy (cx,) + A; (CX) +... + a;,,(CX,,) = c(a, x, +... + AinXn)> 


Hence A(cx) = cAx. 
(b) The ith component of the vector A(x+y) is 


Ay (Xp AY) +--+ ig (Xp FY n) = (Gi Xt oF in Xn) + (GVH 0 + in Yn): 
But this is also the ith component of the vector Ax + Ay since the ith com- 
ponent of Ax is a,,x,+...+4a,,x, and the ith component of Ay is a, y, 


inven 


+...+4a,,y,. Hence A(x-+y)= Ax + Ay. CJ 


PROOF OF THEOREM I. 
(a). If x(t) is a solution of (4), then 


a 


Sex x(t)=c = cAx(t) = A(cx(f)). 


Hence, cx(/) is also a solution of (4). 
(b). If x(t) and y(Z) are solutions of (4) then 


d d 
4 (x()+y(Q)= Ao? + 2 = ancy +ay(sy=Alx( +y(9). 
Hence, x(¢) + y(?) is also a solution of (4). O 


An immediate corollary of Theorem 1 is that any linear combination of 
solutions of (4) is again a solution of (4). That is to say, if x!(1),...,x/(Z) are 
j solutions of (4), then c,x'()+...+ c;x/ (t) is again a solution for any 
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choice of constants c,,c,...,¢;. For example, consider the system of equa- 
tions 


ax, ax, dx 0 1 xy 
Be Gn 8 GAC Gols x-(2) 


This system of equations was derived from the second-order scalar equa- 
tion (d’y / dt?) + 4y =0 by setting x, =y and x,=dy/dt. Since y,(t)=cos2r 
and y,(f)=sin2¢ are two solutions of the scalar equation, we know that 


x,(t) cos2t sin 2¢ 
x(t) = = Cy ; +, 
x(t) —2sin2¢ 2cos2t 
c,cos2t+c,sin2¢ 
—2c,sin2t+2c,cos2t 


is a solution of (6) for any choice of constants c, and cp. 

The next step in our game plan is to show that every solution of (4) can 
be expressed as a linear combination of finitely many solutions. Equiv- 
alently, we seek to determine how many solutions we must find before we 
can generate all the solutions of (4). There is a branch of mathematics 
known as linear algebra, which addresses itself to exactly this question, and 
it is to this area that we now turn our attention. 


EXERCISES 


In each of Exercises 1-3 convert the given differential equation for the sin- 
gle variable y into a system of first-order equations. 


d? 2 a d‘ d2 
y (F) -9 2. o” +cosy =e! 3. or 4 aI 
dt? 
4. Convert the pair of second-order equations 


d*y dz dz dy 


into a system of 4 first-order equations for the variables 


x1=y, x2=y’, x3=2, and x4=2Z’. 


5. (a) Let y(t) be a solution of the equation y” + y’+y =0. Show that 


y(t) 
y(t) 


is a solution of the system of equations 


_( 0 1 
x (_‘ _i)x 


x= ( 
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(b) Let 


_ (x 
= ee 


be a solution of the system of equations 
._{ O ] 
w=(_9 _t 
Show that y= x,(t) is a solution of the equation y” +y’+y=0. 


In each of Exercises 6-9, write the given system of differential equations 
and initial values in the form x= Ax, X(f9) =x°. 


6. X,=3x,-—7Xp, x,(0)=1 7. X,=5x,+5x), x,(3)=0 
X2=4x, x,(0)= 1 Xy=—X,+7xX2, x,(3)=6 
8. Xp =X, +xX,— x3, x (0) =0 9. x,= — X43, x,(— 1) =2 
X_=3x)— X_ + 4x3, x,(0)= 1 X2= X1> x,(-— 1)=3 
X= — x1 — Xy, x,(0)= —1 Xy=—Xy, xx(—1)=4 
10. Let 


= 


Compute x+y and 3x—2y. 
11. Let 


Compute Ax if 


l 0 0 
@x-(0) x-(1}, ox-(0} 
0 0 | 


12. Let A be any Xn matrix and let e’ be the vector whose jth component is 1 
and whose remaining components are zero. Verify that the vector Ae’ is the /th 
column of A. 


13. Let 


> 

I 
mm 
| | 
wet Kee 


Compute Ax if 


l l l l 
wa-(2}, x-{-1}, ox-(1} @x-(0} 
4 —] l l 


14. Let A be a 3X3 matrix with the property that 


(iG) =H) 
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Compute 
Hint: Write 


as a linear combination of 


15. Let A be a 2X2 matrix with the property that 


a(i)=(2) and a(_i)=(6) 


Find A. Hint: The easy way is to use Exercise 12. 


3.2 Vector spaces 


In the previous section we defined, in a natural manner, a process of 
adding two vectors x and y together to form a new vector z=x+y, and a 
process of multiplying a vector x by a scalar c to form a new vector u= cx. 
The former process was called vector addition and the latter process was 
called scalar multiplication. Our study of linear algebra begins with the 
more general premise that we have a set V of elements x,y,z,... and that 
we have one process that combines two elements x and y of V to form a 
third element z in V and a second process that combines a number c and 
an element x in V to form a new element u in V. We will denote the first 
process by addition; that is, we will write z=x-+y, and the second process 
by scalar multiplication; that is, we will write u=cx, if they satisfy the us- 
ual axioms of addition and multiplication. These axioms are: 


(i) x+y=y+x (commutative law) 
(li) x+(y+z)=(x+y)+z (associative law) 
(iii) There is a unique element in V, called the zero element, and denoted 
by 0, having the property that x +0=x< for all x in V. 
(iv) For each element x in V there is a unique element, denoted by —x 
and called minus x, such that x +(—x)=0. 
(v) 1-x=x for all x in V. 
(vi) (ab)x = a(bx) for any numbers a,b and any element x in V. 
(vi) a(x+y)=ax+ay 
(vill) (a+ b)x=ax+t bx. 


A set V, together with processes addition and multiplication satisfying 
(i}-(viii) is said to be a vector space and its elements are called vectors. The 
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numbers a,b will usually be real numbers, except in certain special cases 
where they will be complex numbers. 


Remark 1. Implicit in axioms (i)-(viii) is the fact that if x and y are in V, 
then the linear combination ax+ by is again in V for any choice of con- 
stants a and b. 


Remark 2. In the previous section we defined a vector x as a sequence of n 
numbers. In the more general context of this section, a quantity x is a vec- 
tor by dint of its being in a vector space. That is to say, a quantity x is a 
vector if it belongs to a set of elements V which is equipped with two 
processes (addition and scalar multiplication) which satisfy (i)}{viii). As we 
shall see in Example 3 below, the set of all sequences 


xX 
X4 


of n real numbers is a vector space (with the usual operations of vector 
addition and scalar multiplication defined in Section 3.1). Thus, our two 
definitions are consistent. 


Example 1. Let V be the set of all functions x(t) which satisfy the differen- 
tial equation 


2 
ax _ 39 (1) 


with the sum of two functions and the product of a function by a number 
being defined in the usual manner. That is to say, 


CAtAIMO=AOth@ 


and 
(cf )(t) = cf (¢). 


It is trivial to verify that V is a vector space. Observe first that if x! and x? 
are in V, then every linear combination c,x'+c,x? is in V, since the dif- 
ferential equation (1) is linear. Moreover, axioms (i), (ii), and (v)(viii) are 
automatically satisfied since all we are doing at any time ¢ in function 
addition and multiplication of a function by a number is adding or multi- 
plying two numbers together. The zero vector in V is the function whose 
value at any time ¢ is zero; this function is in V since x(#)=0 is a solution 
of (1). Finally, the negative of any function in V is again in V, since the 
negative of any solution of (1) is again a solution of (1). 
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Example 2. Let V be the set of all solutions x(t) of the differential equa- 
tion (d2x /dt*)—6x?=0, with the sum of two functions and the product of 
a function by a number being defined in the usual manner. V is not a vec- 
tor space since the sum of any two solutions, while being defined, is not 
necessarily in V. Similarly, the product of a solution by a constant is not 
necessarily in V. For example, the function x(t)=1/t* is in V since it 
satisfies the differential equation, but the function 2x(f)=2/t? is not in V 
since it does not satisfy the differential equation. 


Example 3. Let V be the set of all sequences 


Xx} 


of n real numbers. Define x+y and cx as the vector addition and scalar 
multiplication defined in Section 3.1. It is trivial to verify that V is a vector 
space under these operations. The zero vector is the sequence 


0 
0 


and the vector —x is the vector 


This space is usually called n dimensional Euclidean space and is denoted 
by R”. 


Example 4. Let V be the set of all sequences 


Ny 


of n complex numbers x,,...,x,- Define x+y and cx, for any complex 
number c, as the vector addition and scalar multiplication defined in Sec- 
tion 3.1. Again, it is trivial to verify that V is a vector space under these 
operations. This space is usually called complex n dimensional space and is 
denoted by C”. 
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Example 5. Let V be the set of all nxn matrices A. Define the sum of two 
matrices A and B to be the matrix obtained by adding together corre- 
sponding elements of A and B, and define the matrix cA to be the matrix 
obtained by multiplying every element of A by c. In other words, 


Qi) Ayn 2+ Ay, bi, Dy wee Dy 
a>) a5 eee Qa, 4 bs, by, eee b>, 
a, 1 an2 Ann b, 1 b> ban 
Ay,+5,,; a+b a,,+,, 
Ay, + by, ayt by Ay, + bp, 
Qn + b, an2 + bya ann + Ban 
and 
Qi, ayn «ss Ay, CA;,; Cay...» = CA, 
Ay, An, «+» Ay, CAy, Cay, ... CA», 
Cc = 
Any Ann ee nn CA,, CAyy see CO, 


Axioms (i), (ii), and (v)(vill) are automatically satisfied since all we are 
doing in adding two matrices together or multiplying a matrix by a number 
is adding or multiplying two numbers together. The zero vector, or the 
matrix 0, is the matrix whose every element is the number zero, and the 
negative of any matrix A is the matrix 


— ay ees —ay, 


—_ Qn} oe ~~ ann 
Hence V is a vector space under these operations of matrix addition and 
scalar multiplication. 


Example 6. We now present an example of a set of elements which comes 
close to being a vector space, but which doesn’t quite make it. The purpose 
of this example is to show that the elements of V can be just about any- 
thing, and the operation of addition can be a rather strange process. Let V 
be the set consisting of three animals, a cat, a dog, and a mouse. Whenever 
any two of these animals meet, one eats up the other and changes into a 
different animal. The rules of eating are as follows. 
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(1) If a dog meets a cat, then the dog eats up the cat and changes into a 
mouse. 

(2) If a dog meets another dog, then one dog eats up the other and 
changes into a cat. 

(3) If a dog meets a mouse, then the dog eats up the mouse and remains 
unchanged. 

(4) If a cat meets another cat, then one cat eats up the other and changes 
into a dog. 

(5) If a cat meets a mouse, then the cat eats up the mouse and remains un- 
changed. 

(6) If a mouse meets another mouse, then one mouse eats up the other and 
remains unchanged. 


Clearly, “eating” 1s a process which combines two elements of V to form a 
third element in V. If we call this eating process addition, and denote it by 
+, then rules 1-6 can be written concisely in the form 


1. D+C=M 2. D+ D=C 3. D+ M=D 
4.C+C=D 5. C+M=C 6. M+M=M. 


This operation of eating satisfies all the axioms of addition. To see this, 
note that axiom (i) is satisfied since the eating formulae do not depend on 
the order of the two animals involved. This is to say, D+ C=C+D, etc. 
Moreover, the result of any addition is again an animal in V. This would 
not be the case, for example, if a dog ate up a cat and changed into a 
hippopotamus. The associative law (ii) is also satisfied, but it has to be 
verified explicitly. For example, suppose that we have an encounter be- 
tween two cats and a dog. It is not obvious, a priori, that it does not make 
a difference whether the two cats meet first and their resultant meets the 
dog or whether one cat meets the dog and their resultant meets the other 
cat. To check that this is so we compute 


(C+C)+D=D+D=C 
and 
(C+D)+C=M+C#=C. 


In a similar manner we can show that the result of any encounter between 
three animals is independent of the order in which they meet. Next, ob- 
serve that the zero element in V is the mouse, since every animal is un- 
changed after eating a mouse. Finally, “minus a dog” is a cat (since D+ C 
= M), “minus a cat” is a dog and “minus a mouse” is a mouse. However, 
V is not a vector space since there is no operation of scalar multiplication 
defined. Moreover, it is clearly impossible to define the quantities aC and 
aD, for all real numbers a, so as to satisfy axioms (v)—(viii). 
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Example 7. Let V be the set of all vector-valued solutions 


x,(t) 
x(t) = 
x,(t) 
of the vector differential equation 
Ay, ++ yy 
x=Ax, A=|: : |. (2) 
a a 


V is a vector space under the usual operations of vector addition and 
scalar multiplication. To wit, observe that axioms (i), (11), and (v)—(vii1) are 
automatically satisfied. Hence, we need only verify that 


(a) The sum of any two solutions of (2) is again a solution. 
(b) A constant times a solution of (2) is again a solution. 
(c) The vector-valued function 


x,(¢) 0 
x()=] = |=]: 
x,(t)} 0 
is a solution of (2) (axiom (iii). 
(d) The negative of any solution of (2) is again a solution (axiom (iv)). 


Now (a) and (b) are exactly Theorem 1 of the previous section, while (d) is 
a special case of (b). To verify (c) we observe that 


0 0 0 0 
0 0 0 0 


Hence the vector-valued function x(t)=0 is always a solution of the dif- 
ferential equation (2). 


EXERCISES 


In each of Problems 1—6, determine whether the given set of elements 


form a vector space under the properties of vector addition and scalar mul- 
tiplication defined in Section 3.1. 
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. The set of all elements x= 


. The set of all elements x= 


. The set of all elements x= 


. The set of elements x= Je a 


. The set of all elements : 


3.3. Dimension of a vector space 


where 3x, —2x,=0 


where x7+ x3+x3=1 


where x, + x,+x3= 1 


where x,;+x,+x3;=0 


l 
for all real numbers a and b 


b 


. The set of all elements x= “| where 


X, +X + x;,=0, X1— X_+2x3=0, 3x,;— xX, +5x,=0 


In each of Problems 7-11 determine whether the given set of functions 
form a vector space under the usual operations of function addition and 
multiplication of a function by a constant. 


7. 


The set of all polynomials of degree <4 


. The set of all differentiable functions 

. The set of all differentiable functions whose derivative at ¢=1 1s three 

. The set of all solutions of the differential equation y” + y =cost 

. The set of all functions y(t) which have period 27, that is y(t+27)=y(Z) 


. Show that the set of all vector-valued solutions 


_ [x1 @) 
=(20) 


of the system of differential equations 


is not a vector space. 


3.3 Dimension of a vector space 


Let V be the set of all solutions y(t) of the second-order linear homoge- 
neous equation (d*y / dt”) + p(t)(dy / dt) + q(t)y =0. Recall that every solu- 
tion y(t) can be expressed as a linear combination of any two linearly inde- 
pendent solutions. Thus, if we knew two “independent” functions y'() and 
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y(t) in V, then we could find every function in V by taking all linear com- 
binations c, y'\(t)+c, y*(t) of y' and y?. We would like to derive a similar 
property for solutions of the equation x=Ax. To this end, we define the 
notion of a finite set of vectors generating the whole space, and the notion 
of independence of vectors in an arbitrary vector space V. 


Definition. A set of vectors x',x’,...,x” is said to span V if the set of all 
linear combinations c,x'!+c,x?+...+c,x” exhausts V. That is to say, 
the vectors x!,x’,...,x” span V if every element of V can be expressed as 
a linear combination of x!,x?,...,x”. 


Example 1. Let V be the set of all solutions of the differential equation 
(dx /dt*?)— x=0. Let x! be the function whose value at any time ¢ is e’ 
and let x? be the function whose value at any time ¢ is e~‘. The functions 
x! and x? are in V since they satisfy the differential equation. Moreover, 
these functions also span V since every solution x(t) of the differential 
equation can be written in the form 


x(t)=c,e'+c,e7' 
so that 


x=¢,x!4+ cx’. 


Example 2. Let V=R" and let e’ denote the vector with a 1 in the jth place 
and zeros everywhere else, that is, 


I 0 0 
0 I 0 
el= 0 e= 0 es 
0 
0 0 


The set of vectors e',e’,...,e” span R” since any vector 


x} 
X4 
x=|. 
X, 
can be written in the form 
Xj 0 0 
0 *2 1 2 
X=}. [+]. [t...F] . | =xetx e+... +x,e". 
0 0 Xn 
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Definition. A set of vectors x',x’,...,x” in V is said to be Jinearly dependent 
if one of these vectors is a linear combination of the others. A very pre- 
cise mathematical way of saying this is as follows. A set of vectors 
x',x’,...,x” is said to be linearly dependent if there exist constants 
C15C25-++5C,, Not all zero such that 


cx! +ox?+...+¢,x"=0. 


These two definitions are equivalent, for if x’ is a linear combination of 
x) x xt! lx”, that is 


X= Cx! +... 46x +64 xt +... +0,x", 


then the linear combination 
] J-l_ew J+) n 
CyX +... F6_ 4X XC 4)X ° +... +C,X 


equals zero and not all the constants are zero. Conversely, if c,x! + cx? 
+... +c¢,x"=0 and c; #0 for some j, then we can divide by G and solve for 
x/ as a linear combination of x!,...,x/~',x/t!,...,x”. For example, if c, #0 
then we can divide by c, to obtain that 


C; C; coe C, 


Definition. If the vectors x',x?,...,x” are not linearly dependent, that is, 
none of these vectors can be expressed as a linear combination of the 
others, then they are said to be linearly independent. The precise 
mathematical way of saying this is that the vectors x',x’,...,x” are lin- 
early independent if the equation 


Cx! +cx*+...+¢,x"=0 
implies, of necessity, that all the constants c,,c.,...,c, are zero. 
In order to determine whether a set of vectors x!,x’,...,x” is linearly de- 
pendent or linearly independent, we write down the equation c,x! + cx’ 
+...+c,x"=0 and see what this implies about the constants c),C),...,C,. 
If all these constants must be zero, then x!,x’,...,x” are linearly indepen- 


dent. On the other hand, if not all the constants c,,c,,...,c, must be zero, 
then x!,x’,...,x” are linearly dependent. 


Example 3. Let V=R? and let x', x”, and x? be the vectors 


l ] 3 
x'=|-]], x*=/]2] and x=|0O!. 
l 3 5 


281 


3 Systems of differential equations 


To determine whether these vectors are linearly dependent or linearly inde- 
pendent, we write down the equation c,x' + c,x?+¢,x’=0, that is 


] ] 3 0 
Cy} ~1L]| +e.) 2) 4+¢,/0)=/0]. 
] 3 5 0 


The left-hand side of this equation is the vector 


C,) +c, +3c; 
c,+3c,+5c; 


Hence the constants c,, c,, and c,; must satisfy the equations 


C) +c, +3c,=0, (i) 
—c¢,+2c,=0, (ii) 
C,+3c,+5c,=0. (iii) 


Equation (ii) says that c, =2c,. Substituting this into Equations (i) and (iii) 
gives 
3¢,+3c,=0 and 5c,+5c,=0. 


These equations have infinitely many solutions c,,c, since they both reduce 


to the single equation c,+c,=0. One solution, in particular, is c,=—1, 
c,;=1. Then, from Equation (11), c, = —2. Hence, 
l l 3 0 
—2} -—1/-—|21+/0]=]0 
| 3 5 0 


and x!, x” and x? are linearly dependent vectors in R?. 


Example 4. Let V=R’ and let e',e,...,e” be the vectors 


| 0 0 
0 l 0 
e=|9), =| 90]. era]: 
0 
0 0 ] 


To determine whether e',e’,...,e” are linearly dependent or linearly inde- 
pendent, we write down the equation cje'+... +c,e”=0, that is 


l 0 0 0 
0 l 0 0 
Cy 0 + C, 0 +...4¢6,| . [= 0 
: : 0 ; 
0 0 I 0 
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The left-hand side of this equation is the vector 


Hence c, =0,c,=0,..., and c,=0. Consequently, e',e’,...,e” are linearly 
independent vectors in R”. 


Definition. The dimension of a vector space V, denoted by dimV, is the 
fewest number of linearly independent vectors which span V. V is said 
to be a finite dimensional space if its dimension is finite. On the other 
hand, V is said to be an infinite dimensional space if no set of finitely 
many elements span V. 


The dimension of a space V can be characterized as the fewest number 
of elements that we have to find in order to know all the elements of V. In 
this sense, the definition of dimension captures our intuitive feeling. How- 
ever, it is extremely difficult to compute the dimension of a space V from 
this definition alone. For example, let V=R”. We have shown in Examples 
2 and 4 that the vectors e',e’*,...,e” are linearly independent and span V. 
Moreover, it seems intuitively obvious to us that we cannot generate R” 
from fewer than n vectors. Thus, the dimension of R” should be n. But how 
can we prove this rigorously? To wit, how can we prove that it is impossi- 
ble to find a set of (n— 1) linearly independent vectors that span R"? Thus, 
our definition of dimension is not, as yet, a very useful one. However, it 
will become extremely useful after we prove the following theorem. 


Theorem 2. Jf n linearly independent vectors span V, then dimV=n. 


We will need two lemmas to prove Theorem 2. The first lemma con- 
cerns itself with the solutions of simultaneous linear equations and can be 
motivated as follows. Suppose that we are interested in determining n un- 
known numbers x,, x ,...,x, uniquely. It seems pretty reasonable that we 
should be given n equations satisfied by these unknowns. If we are given 
too few equations then there may be many different solutions, that is, 
many different sets of values for x,,x>,...,x, which satisfy the given equa- 
tions. Lemma | proves this in the special case that we have m homoge- 
neous linear equations for n>m unknowns. 


Lemma 1. A set of m homogeneous linear equations for n unknowns 
X1,Xz,...,X, always admits a nontrivial solution if m<n. That is to say, 
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the set of m equations in n unknowns 
AX, + AyyX, +... + a,,x, =0 


Ay) X1 + Ay Xy +... + ay, x, =0 


(1) 
An {XH AygXy +... $4,,,X, =0 
always has a solution X,,X>,...,X,, other than x, =... =x, =0, if m<n. 


Remark. Notice that x, =0,x,=0,...,x, =0 is certainly one solution of the 
system of equations (1). Thus, Lemma | is telling us that these equations 
have more than one solution. 


PROOF OF LEMMA |. We will prove Lemma 1 by induction on m. To this 
end, observe that the lemma is certainly true if m=1, for in this case we 
have a single equation of the form a,,x,+4).x,+...+4,,x, =0, with 
n2>2. We can find a nontrivial solution of this equation, if a,, =0, by 
taking x, =1,x,=0,...,x, =0. We can find a nontrivial solution of this 
equation, if a,, #0, by taking x, =1,...,x, =1 and x, =—(apy+...+ 
Q1,)/ a}. 

For the next step in our induction proof, we assume that Lemma 1 is 
true for some integer m=k and show that this implies that Lemma | 1s 
true for m=k+1, and k+1< xn. To this end, consider the k+1 equations 


for the n unknowns x,,x,,...,X, 


AyjX, + AyyxX, +...+ a,x, =0 
Ay,X, + AX. +...+ a,x, =0 
(2) 
Me 11% 1 + Ay 1,2%2F FO 1 2X =O 
with kK+1<n. If ay),@),...,4,4, are all zero, then x,=1,x,=0,...,x, =0 


is clearly a non-trivial solution. Hence, we may assume that at least one of 
these coefficients is not zero. Without any loss of generality, we may 
assume that a,,+0, for otherwise we can take the equation with the non- 
zero coefficient of x, and relabel it as the first equation. Then 


Substituting this value of x, in the second through the (k + 1)st equations, 
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we obtain the equivalent equations 


QyyX,+ AyyX, + Ax, +...+ a,x, =0 
Dy9X_ + box, +...+ b,x, =0 

(3) 
byoXy + Dx, +...+ b,x, =O 


Dy 4 1,2%2 + Dy 4 1,3%3 + .«-. + by, 1nX%p =O 


where b,, =a, — a;,4,;/a,,. Now, the last k equations of (3) are k homoge- 
neous linear equations for the (n—1) unknowns x,,...,x,. Moreover, k is 
less than n—1 since k +1 is less than n. Hence, by the induction hypothe- 
sis, these equations have a nontrivial solution x,,...,x,. Once x5,...,x, are 


known, we have as before x, = —(a,)x, +... +4,,x,)/a,, from the first 
equation of (3). This establishes Lemma | for m=k +1, and therefore for 
all m, by induction. O 


If a vector space V has dimension m, then it has m linearly independent 
vectors x!,...,x”” and every vector in the space can be written as a linear 
combination of the m vectors x',x’,...,x’”. It seems intuitively obvious to 
us in this case that there cannot be more than m linear independent vectors 
in V. This is the content of Lemma 2. 


Lemma 2. Jn an m dimensional space, any set of n>m vectors must be lin- 
early dependent. In other words, the maximum number of linearly indepen- 
dent vectors in a finite dimensional space is the dimension of the space. 


Proor. Since V has dimension m, there exist m linearly independent vec- 
tors x',x’,...,x” which span V. Let y!,y’,...,y” be a set of n vectors in V, 
with n> m. Since x',x’,...,x” span V, all the y’ can be written as linear 


combinations of these vectors. That is to say, there exist constants a,, 1 <i 
<n, 1<j<m such that 
1 1 2 m 
Y =Q,,X +Q,.X° +... +4a,,,x 
2 | 2 m 
Yo =Q5)X Fax +... $a,,X 
(4) 
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To determine whether y',y’,...,y” are linearly dependent or linearly inde- 
pendent, we consider the equation 


ay't+toy'+...+c¢y"=0. (5) 
Using (4) we can rewrite (5) in the form 
O=cy'+oy?t+...+cy" 
DX +(€) Gyo + «0. + Gyo) X” 


+... 4+(6)0),, + -2) FC; )X”- 


navunam 


=(c,a,,+...+0¢,a 


This equation states that a linear combination of x!,x’,...,x’” is zero. Since 
x!,x’,...,x” are linearly independent, all these coefficients must be zero. 


Hence, 
CAj,; + C,a,, +... + ¢,a,, =0 


C1Ayy + CA) +... + 6,4, =0 


(6) 


Cm F CoQ ayy F 00. FC An = 9. 


Now, observe that the system of Equations (6) is a set of m homogeneous 
linear equations for n unknowns c¢),Cc),...,c,, with n>m. By Lemma 1, 
these equations have a nontrivial solution. Thus, there exist constants 
C1,C2,+++,C,, not all zero, such that c,y'+ cy*+ ... +c,y”=0. Consequently, 
y',y’,...,y” are linearly dependent. O 


We are now in a position to prove Theorem 2. 


PROOF OF THEOREM 2. If 7 linearly independent vectors span V, then, by 
the definition of dimension, dimV <n. By Lemma 2, n<dimV. Hence, 
dim V =n. O 


Example 5. The dimension of R’ is n since e!,e’,...,e” are n linearly inde- 
pendent vectors which span R”. 


Example 6. Let V be the set of all 33 matrices 
Qi, 42 43 
A= {4% 42 43], 
43, 432 433 
and let E,, denote the matrix with a one in the ith row, jth column and 
zeros everywhere else. For example, 


0 0 O 
E,;= 0 0 | 
0 0 0 
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To determine whether these matrices are linearly dependent or linearly in- 
dependent, we consider the equation 


3 0 0 0 
» c,;E;=0=|0 0 O}. (7) 
ij=l 0 0 O 


Now, observe that the left-hand side of (7) is the matrix 


1 O O 0 1 0 0 0 O Ciyy C2 C43 
C110 O Of Fey!/0 O OFF... F¢33;0 O OO} = [21 C22 C23 
0 0 O 0 0 O 0 O 1 C3, C30 C33 


Equating this matrix to the zero matrix gives c,, =0,c). =0,...,¢33 =0. 
Hence the 9 matrices E,, are linearly independent. Moreover, these 9 
matrices also span V since any matrix 


Qi, 42 3 
A=]4% 4&2 93 
43; 432 433 


can obviously be written in the form A= > * GE Hence dimV=9. 


Definition. If a set of linearly independent vectors span a vector space V, 
then this set of vectors is said to be a basis for V. A basis may also be 
called a coordinate system. For example, the vectors 


l 0 0 0 
i 0 2 ] 3_|0 4_|0 
— — 9 = d — 
e 0 e 0 e and e 0 
0 0 0 l 
are a basis for R*. If 
XxX} 
_ | *2 
x= x,’ 
X4 


then x= x,e' + x,e7 + x,e° + x,e*, and relative to this basis the x, are 
called “components” or “coordinates.” 


Corollary. In a finite dimensional vector space, each basis has the same num- 
ber of vectors, and this number is the dimension of the space. 


The following theorem is extremely useful in determining whether a set 
of vectors is a basis for V. 
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Theorem 3. Any n linearly independent vectors in an n dimensional space V 
must also span V. That is to say, any n linearly independent vectors in an n 
dimensional space V are a basis for V. 


Proor. Let x',x’,...,x” be n linearly independent vectors in an n dimen- 
sional space V. To show that they span V, we must show that every x in V 
can be written as a linear combination of x!,x’,...,x”. To this end, pick 
any x in V and consider the set of vectors x,x!,x*,...,x”. This is a set of 
(n+ 1) vectors in the n dimensional space V; by Lemma 2, they must be 
linearly dependent. Consequently, there exist constants c,c,,C>,...,C,, not 
all zero, such that 


cxtc,x'+cx*+... +0,x"=0. (8) 


Now c<0, for otherwise the set of vectors x!,x’,...,x” would be linearly 
dependent. Therefore, we can divide both sides of (8) by c to obtain that 


Cy 5) Ch 
x= —— x! -— —x?- |... —- — x", 
C C C 


Hence, any 7 linearly independent vectors in an n dimensional space V 
must also span V. O 


Example 7. Prove that the vectors 


1_{ | 2 | 
({) and =(_| 
form a basis for R?. 


Solution. To determine whether x! and x? are linearly dependent or lin- 
early independent, we consider the equation 


oxttow=c(t}+oa(_l)=(0), (9) 


Equation (9) implies that c; +c, =0 and c, —c,=0. Adding these two equa- 
tions gives c, =0 while subtracting these two equations gives c,=0. Conse- 
quently, x' and x? are two linearly independent vectors in the two dimen- 
sional space R*. Hence, by Theorem 3, they must also span V. 


EXERCISES 


In each of Exercises 1-4, determine whether the given set of vectors is lin- 
early dependent or linearly independent. 


(h(a Ca) 8) (9 
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10. 


3.3. Dimension of a vector space 


=") 


. Let V be the set of all 22 matrices. Determine whether the following sets of 


matrices are linearly dependent or linearly independent in V. 


(19) (8 9) (9 8) ame (8 1) 


(1 ob (ot) Go) a (1 a} 


. Let V be the space of all polynomials in ¢ of degree <2. 


(a) Show that dim V=3. 

(b) Let p,, p, and p; be the three polynomials whose values at any time ¢ are 
(t— 1)’, (t—2)?, and (t—1) (t—2) respectively. Show that p,, p2, and p; are 
linearly independent. Hence, conclude from Theorem 3 that p,, p2, and p3; 
form a basis for V. 


. Let V be the set of all solutions of the differential equation d*y /dt*—y =0. 


(a) Show that V is a vector space. 
(b) Find a basis for V. 


. Let V be the set of all solutions of the differential equation (d*y /dt*)+y =0 


which satisfy y(0)=0. Show that V is a vector space and find a basis for it. 


. Let V be the set of all polynomials p(t)=a + a,t+a,t* which satisfy 


p(0) +2p’(0) + 3p” (0) =0. 
Show that V is a vector space and find a basis for it. 
Let V be the set of all solutions 
x(t) 
x= X(t) 
x3(t) 


of the differential equation 


0 1 O 
x=[0 0 1 |x. 
6 -I1 6 


Show that 
e! e2! ert 
x()=let|, x=] 262], and x(t)=| 3¢% 
e! 4e! Qe! 


form a basis for V. 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 
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Let V be a vector space. We say that W is a subspace of V if W is a subset of V 
which is itself a vector space. Let W be the subset of R? which consists of all 
vectors 


which satisfy the equations 


X,+X2+2x,=0 
2X;—X2+ x,=0 
6x, +6x,;=0. 


Show that W is a subspace of R? and find a basis for it. 


Prove that any n vectors which span an n dimensional vector space V must be 
linearly independent. Hint: Show that any set of linearly dependent vectors 
contains a linearly independent subset which also spans V. 


Let v',v’,...,v” be n vectors in a vector space V. Let W be the subset of V 
which consists of all linear combinations c,v'+c,v? +... +c¢,v" of v',y’,...,Vv". 
Show that W is a subspace of V, and that dimW< n. 


Let V be the set of all functions f(¢) which are analytic for |t|<1, that is, f(d) 
has a power series expansion f(t)= a 9+ a,t+a)t?+... which converges for |¢| 
<1. Show that V is a vector space, and that its dimension 1s infinite. Hint: V 
contains all polynomials. 


Let v!,v2,...,v” be m linearly independent vectors in an n dimensional vector 
space V, with n>m. Show that we can find vectors v"*!,...,v” so that 
vijv’,...,.w%v"+! ...,v" form a basis for V. That is to say, any set of m linearly 
independent vectors in an n>m dimensional space V can be completed to 
form a basis for V. 


Find a basis for R? which includes the vectors 


() = () 


(a) Show that 


are linearly independent in R?. 


(b) Let 


x} 
X=[ xX) | =x,e! + x,e7+ x3e°. 
x3 


3.4 Applications of linear algebra to differential equations 


Since v',y’, and v° are linearly independent they are a basis and x=y,v' + 
y2v? + y3v>. What is the relationship between the original coordinates x; and 
the new coordinates y,? 

(c) Express the relations between coordinates in the form x = By. Show that the 
columns of B are v', y*, and v’. 


3.4 Applications of linear algebra 
to differential equations 


Recall that an important tool in solving the second-order linear homoge- 
neous equation (dy /dt*) + p(t)(dy /dt)+ q(t)y =0 was the existence- 
uniqueness theorem stated in Section 2.1. In a similar manner, we will 
make extensive use of Theorem 4 below in solving the homogeneous linear 
system of differential equations 


xy Qype. A 


A _ay = x=l|:|, Aegl: | (1) 


dt 


The proof of this theorem will be indicated in Section 4.6. 


Theorem 4 (Existence—uniqueness theorem). There exists one, and only 
one, solution of the initial-value problem 


x] 

dx 0 x} 

Ae X(t)) =x = |: (2) 
x? 


Moreover, this solution exists for —0 <t<oo. 


Theorem 4 is an extremely powerful theorem, and has many implica- 
tions. In particular, if x(7) is a nontrivial solution, then x(#)#90 for any ¢. 
(If x(¢*)=0 for some f*, then x(t) must be identically zero, since it, and the 
trivial solution, satisfy the same differential equation and have the same 
value at t= ¢*.) 

We have already shown (see Example 7, Section 3.2) that the space V of 
all solutions of (1) is a vector space. Our next step is to determine the di- 
mension of V. 


Theorem 5. The dimension of the space V of all solutions of the homogeneous 
linear system of differential equations (1) is n. 
PROOF. We will exhibit a basis for V which contains n elements. To this 
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end, let @/(t), 7=1,...,n be the solution of the initial-value problem 


0 
ax 0 
—=Ax, x(0)=e/=/ 1 | —/th row. (3) 
dt 

0 

0 


For example, '(f) is the solution of the differential equation (1) which 
satisfies the initial condition 


'(0)=e'= 
0 
Note from Theorem 4 that $/(f) exists for all ¢ and is unique. To de- 


termine whether $',?,...,@” are linearly dependent or linearly indepen- 
dent vectors in V, we consider the equation 


Co'+0,67+...+¢,6"=0 (4) 


where the zero on the right-hand side of (4) stands for the zero vector in V 
(that is, the vector whose every component is the zero function). We want 


to show that (4) implies c,; =c,=... =c, =0. Evaluating both sides of (4) at 
t=(Q gives 
0 
0 
c,6' (0) + c97(0) +... +¢,6"(0)=| - | =0 
0 
or 


cje'+ce*+...+ce"=0. 


Since we know that e',e’,...,e” are linearly independent in R”, c,=c, 
=... =c, =0. We conclude, therefore, that @',’,...,@” are linearly inde- 
pendent vectors in V. 

Next, we claim that $',67,...,@” also span V. To prove this, we must 
show that any vector x in V (that is, any solution x(t) of (1)) can be written 
as a linear combination of $',¢’,...,@”. To this end, pick any x in V, and 
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let 


be the value of x at t=0 (x(0)=c). With these constants c),c),...,¢,, Con- 
struct the vector-valued function 


$(t)=c\o'(t) +c? (t) +... 6,6" (2). 


We know that $(?) satisfies (1) since it is a linear combination of solutions. 
Moreover, 


(0) = c,p' (0) + c,67(0) + ... +c," (0) 


| 0 0 C| 
0 | 0 C2 

=C,| . |te) . [+..ta} . [=]. =x(0). 
0 0 1} le 


Now, observe that x(t) and (Z) satisfy the same homogeneous linear sys- 
tem of differential equations, and that x() and $(¢) have the same value at 
t=(0. Consequently, by Theorem 4, x(t) and #(¢) must be identical, that is 


x(t1)=(t) =c,'(t) + e997 (1) +... +0,6" (2). 
Thus, ¢!,7,...,@” also span V. Therefore, by Theorem 2 of Section 3.3, 
dim V=n. ‘a 


Theorem 5 states that the space V of all solutions of (1) has dimension 
n. Hence, we need only guess, or by some means find, n linearly indepen- 
dent solutions of (1). Theorem 6 below establishes a test for linear indepen- 
dence of solutions. It reduces the problem of determining whether n solu- 
tions x!,x’,...,x” are linearly independent to the much simpler problem of 
determining whether their values x'(t,),X?(fo),...,X"(¢) at an appropriate 
time fy are linearly independent vectors in R’”. 


Theorem 6 (Test for linear independence). Let x'!,x’,...,x* be k solutions 
of x=Ax. Select a convenient ty. Then, x',...,x* are linear independent 
solutions if, and only if, x'(to), X°(to), ...,X“ (to) are linearly independent 
vectors in R". 


Proor. Suppose that x!,x?,...,x* are linearly dependent solutions. Then, 
there exist constants c,c3,...,c,, not all zero, such that 
cx! +¢x°+... +¢,x* =0. 
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Evaluating this equation at t=) gives 


¢,x"(t9) + C)X*(to) +... +6, x" (fp) =| - 
0 


Hence x!(t9),x7(to),...,X*“ (to) are linearly dependent vectors in R’. 

Conversely, suppose that the values of x!,x’,...,x* at some time f, are 
linearly dependent vectors in R”. Then, there exist constants c,,C3,...,C;,; 
not all zero, such that 


Oo © 


CX" (fo) + CyX"(fo) +... HE,X*(tg) =|. | =O. 
0 


With this choice of constants c,,c>,...,¢,, construct the vector-valued func- 
tion 


(t)=c,x!(t)+ cx*(t) +... +¢,x*(2). 


This function satisfies (1) since it is a linear combination of solutions. 
Moreover, $(f))=0. Hence, by Theorem 4, (t)=0 for all ¢. This implies 
that x!,x?,...,x* are linearly dependent solutions. Oo 


Example 1. Consider the system of differential equations 


dx, 
—> = xX 
dt dx _( 0 \) -(%) 
dx, Or "dt =(_' 2) **\x/ (9) 
This system of equations arose from the single second-order equation 
d*y |, dy 
We +277 +y =(0 (6) 


by setting x, =y and x,=dy/dt. Since y,(t)=e ‘ and y,(t)=te~‘ are two 
solutions of (6), we see that 
vil e' 
m= ( — =) 


2/0) te~' 
r0-( 4] 


and 
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are two solutions of (5). To determine whether x! and x? are linearly de- 
pendent or linearly independent, we check whether their initial values 


=(_1) 
and 
o-(° 


are linearly dependent or linearly independent vectors in R*. Thus, we con- 
sider the equation 


6 x!(0) + 65x"(0) =( ce, )= (0): 


This equation implies that both c, and c, are zero. Hence, x'(0) and x?(0) 
are linearly independent vectors in R?. Consequently, by Theorem 6, x!(f) 
and x*(t) are linearly independent solutions of (5), and every solution x(f) 
of (5) can be written in the form 


xa(2PJao( ei) ra( (ttc) 


(arene (7) 


(c,—¢,—et)e7 


Example 2. Solve the initial-value problem 


a7(- 2s x@=(7) 


Solution. From Example 1, every solution x/t) must be of the form (7). The 
constants c, and c, are determined from the initial conditions 


(})=x@=(.,".,}. 


Therefore, c,=1 and c,=1+c,=2. Hence 


_[ x, (2) _ (1+2t)e~‘ 
0-0] op) 


Up to this point in studying (1) we have found the concepts of linear al- 
gebra such as vector space, dependence, dimension, basis, etc., and vector- 
matrix notation useful, but we might well ask is all this other than simply 
an appropriate and convenient language. If it were nothing else it would be 
worth introducing. Good notations are important in expressing mathemati- 
cal ideas. However, it is more. It is a body of theory with many applica- 
tions. 
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In Sections 3.8—3.10 we will reduce the problem of finding all solutions 
of (1) to the much simpler algebraic problem of solving simultaneous linear 
equations of the form 


A,X, tayyxX t+... ta,,x, =), 
Ay X, + AyyX +... $5, x, = 5, 


AnjX, FAX t+... +a,,X, =), 


Therefore, we will now digress to study the theory of simultaneous linear 
equations. Here too we will see the role played by linear algebra. 


EXERCISES 


In each of Exercises 1—4 find a basis for the set of solutions of the given 
differential equation. 


1. i=( J _ | )x (Hint: Find a second-order differential equation satis- 
fied by x,(Z).) 


0 1 O 
2. ‘(1 0 1 (Hint: Find a third-order differential equation satis- 


2 -1 2 
fied by x,(t).) 
1 0 0 
3, x=(/ a 4.x=11 1 olx 
2 | 11 


For each of the differential equations 5—9 determine whether the given 
solutions are a basis for the set of all solutions. 


--( 9 —I1)\,. -{ e') yanele' 
5. x (_§ as x'(2) (_“), x2(t) (<-') 
4 2 2 et 0 oot 
—] 3 1 }s x()= a x*(t) = «| x(1)= | 0 | 
1 -1 §5 0 eft oot 


—-3 -2 3 e 2! et —e~ 4 
7. -| l 0 -5}s x)= e-4|,xX(N=| —e2|,x(D=] en 4 
l 


—2 —] e 2! er! e~* 
—5 2 -2 e 3! 0 
8.X=| 1 -4 -1)x; x'(=]| 0-3], x(D=| e7> 
—] 1 —-6 0 eat 


3.5 The theory of determinants 


—5 2 -2 e > 
9x=( 1 -4 -1]x; x'(=| 2-31, 


—1 1 —-6 0 
0 ep eo™ 2e~7 
x(N=le |, = e~ , x)= e~> 
e~! ewit et 4Qe77 


10. Determine the solutions $',¢’,...,@" (see proof of Theorem 5) for the system 
of differential equations in (a) Problem 5; (b) Problem 6; (c) Problem 7. 


11. Let V be the vector space of all continuous functions on (— 00, 00) to R” (the 
values of x(t) lie in R"). Let x!,x’,...,x” be functions in V. 
(a) Show that x!(¢9),...,x"(fp) linearly independent vectors in R” for some fy 
implies x!,x?,...,x” are linearly independent functions in V. 
(b) Is it true that x!(t9),...,x"(tp) linearly dependent in R” for some fp implies 
x! x?,...,x” are linearly dependent functions in V? Justify your answer. 


12. Let u be a vector in R”(u0). 
(a) Is x(¢)=¢u a solution of a linear homogeneous differential equation x = Ax? 
(b) Is x(t) = eu? (c) Is x(t)=(e'—e7")u? 
(d) Is x(t)=(e' +e "Ju? (e) Is x(t) =(e*" + er2‘)u? 
(f) For what functions ¢(f) can x(t)=¢(d)u be a solution of some x= Ax? 


3.5 The theory of determinants 


In this section we will study simultaneous equations of the form 
A,X, FAX +... +a,,x, =), 


Ay) X + AyX,+... tay, x, = 5, 


(1) 


Aj X, +A, X_ +... +4,,X, = b,. 


Our goal is to determine a necessary and sufficient condition for the sys- 
tem of equations (1) to have a unique solution x,,x,...,X,- 

To gain some insight into this problem, we begin with the simplest case 
n=2. If we multiply the first equation a,,x,+4,.x,= 5, by a,,, the second 
equation a,x, +d ,x,=b, by a,,, and then subtract the former from the 
latter, we obtain that 

(41 1@z2 — 412421 )X2 = a,b, — a2,)). 

Similarly, if we multiply the first equation by a,,, the second equation by 
a;, and then subtract the latter from the former, we obtain that 


(441449 — 4129) )X 1 = Ay2b, — a,b). 
Consequently, the system of equations (1) has a unique solution 
yb, — aynb, aby — agyby 


1 _ ’ 2~ _ 
1142 — @)24) 11422 — 4124 
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if the number a,,43) — a,,4>, 18 unequal to zero. If this number equals zero, 
then we may, or may not, have a solution. For example, the system of 
equations 


X)—-X,=1, 2x,—2x,=4 


obviously has no solutions, while the system of equations 


has an infinity of solutions x,=c,x,=c for any number c. For both these 
systems of equations, 


1499 — G74, = 1(—2) —(—1)2=0. 
The case of three equations 
AX, + AyyXy + Ay3X3= by 
Ay 1X + Ay)X2 + A73X4= 5, (2) 
3X1 + A3)X4 + a33x5= b; 


in three unknowns x,,x,,x, can also be handled quite easily. By eliminat- 
ing one of the variables from two of the equations (2), and thus reducing 
ourselves to the case n=2, it is possible to show (see Exercise 1) that the 
system of equations (2) has a unique solution x,,x,,x, if, and only if, the 
number 


Ay 1Ay7433 + Ay 7A7303) + A134) )A3) — Ay4Ay9A3) — y9A74443—A4\Ay443, (3) 


is unequal to zero. 
We now suspect that the system of equations (1), which we abbreviate 
in the form 


xX by 


has a unique solution x if, and only if, a certain number, which depends on 
the elements a, of the matrix A, is unequal to zero. We can determine this 
number for n=4 by eliminating one of the variables from two of the equa- 
tions (1). However, the algebra is so complex that the resulting number is 
unintelligible. Instead, we will generalize the number (3) so as to associate 
with each system of equations Ax=b a single number called determinant A 
(detA for short), which depends on the elements of the matrix A. We will 
establish several useful properties of this association, and then use these 
properties to show that the system of equations (4) has a unique solution x 
if, and only if, detA+0. 

If we carefully analyze the number (3), we see that it can be described in 
the following interesting manner. First, we pick an element a,, from the 
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first row of the matrix 


A=1/4; 422 Qy3]. 


43; 432 433 


This element can be either a,),a@,), or a,3. Then, we multiply a,, by an ele- 
ment a), from the second row of A. However, j, must not equal j,. For ex- 
ample, if we choose a,, from the first row of A, then we must choose either 
ay, OF a, from the second row of A. Next, we multiply these two numbers 
by the element in the third row of A in the remaining column. We do this 
for all possible choices of picking one element from each row of A, never 
picking from the same column twice. In this manner, we obtain 6 different 
products of three elements of A, since there are three ways of choosing an 
element from the first row of A, then two ways of picking an element from 
the second row of A, and then only one way of choosing an element from 
the third row of A. Each of these products a,, a), a,, is multiplied by +1, 
depending on the specific order j, j,j,;. The products a,,a,,a,, with 
CG, J2J3) = (123), (231), and (312) are multiplied by +1, while the products 
ay; Az, a3, with (J; J2/3) = (321), (213), and (132) are multiplied by —1. Fi- 
nally, the resulting numbers are added together. 

The six sets of numbers (123), (231), (312), (321), (213), and (132) are 
called permutations, or scramblings, of the integers 1, 2, and 3. Observe that 
each of the three permutations corresponding to the plus terms requires an 
even number of interchanges of adjacent integers to unscramble the per- 
mutation, that is, to bring the integers back to their natural order. Simi- 
larly, each of the three permutations corresponding to the minus terms re- 
quires an odd number of interchanges of adjacent integers to unscramble 
the permutation. To verify this, observe that 


23132134123 (2 interchanges) 

312132123 (2 interchanges) 
321312132123 (3 interchanges) 

213-123 and 132-123 (1 interchange each). 


This motivates the following definition of the determinant of annxXn 
matrix A. 


Definition. 
det A= » Gir nV in Ty, (5) 
Jloeees Jn 
where ¢ ,_, =1 if the permutation (/,/.../,) is even, that is, if we can 


bring the integers /,...j, back to their natural order by an even number 
of interchanges of adjacent integers, ande, ,_, = — 1 if the permutation 
Ci J2---J,) 1s odd. In other words, pick an element a,, from the first row 
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of the matrix A. Then, multiply it by an element a,, from the second 
row of A, with j,~/,. Continue this process, going from row to row, and 
always picking from a new column. Finally, multiply the product 
Ay; Az;,...dy by +1 if the permutation (j,/,...j,) is even and by —1 if 
the permutation is odd. Do this for all possible choices of picking one 
element from each row of A, never picking from the same column twice. 
Then, add up all these contributions and denote the resulting number 
by detA. 


Remark. There are many different ways of bringing a permutation of the 
integers 1,2,...,1 back to their natural order by successive interchanges of 
adjacent integers. For example, 

4312 4132-5 1432-14235 1243-51234 
and 

431234123 142-3124 1324 1234. 
However, it can be shown that the number of interchanges of adjacent in- 
tegers necessary to unscramble the permuation /,...j, is always odd or al- 
ways even. Hence ¢,__, is perfectly well defined. 


Example 1. Let 
ay, a 
Aa( 7 ia} 
(a a2 

In this case, there are only two products a,,a,, and a,,a,, that enter into 
the definition of detA. Since the permutation (12) is even, and the per- 
mutation (21) is odd, the term a,,a,, is multiplied by +1 and the term 
Ay, 18 multiplied by — 1. Hence, detA=a,,a,,— a, 45. 


1 1 1 
det} 3 2 |i. 
-1 -1 2 
Solution. A shorthand method for computing the determinant of a 3x3 


matrix is to write the first two columns after the matrix and then take 
products along the diagonals as shown below. 


Example 2. Compute 


I a a eae 
1 414 NIX X 1 
det} 3 2 1l/= 3 2 y) Pp 2 
“1-1 2) 57 XX LN, 

N+ 


= 1-2-241-1-(—1)4+1-3(—1)—(-1)-2-1—(-1)-1-1-2-3-1= 3. 


If A is ann Xn matrix, then detA will contain, in general, n! products of 
n elements. The determinant of a 4x4 matrix contains, in general, 24 
terms, while the determinant of a 10x 10 matrix contains the unacceptably 
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large figure of 3,628,800 terms. Thus, it is practically impossible to com- 
pute the determinant of a large matrix A using the definition (5) alone. The 
smart way, and the only way, to compute determinants is to (i) find special 
matrices whose determinants are easy to compute, and (11) reduce the prob- 
lem of finding any determinant to the much simpler problem of computing 
the determinant of one of these special matrices. To this end, observe that 
there are three special classes of matrices whose determinants are trivial to 
compute. 


1. Diagonal matrices: A matrix 


a, O O.... O 

0 a, O 0 
A= , 

0 0 OO .. +a 


mn 


whose nondiagonal elements are all zero, is called a diagonal matrix. Its 
determinant is the product of the diagonal elements a,,,a,,...,4,,. This 
follows immediately from the observation that the only way we can choose 
a nonzero element from the first row of A is to pick a,,. Similarly, the only 
way we can choose a nonzero element from the jth row of A is to pick a,. 
Thus the only nonzero product entering into the definition of detA is 
Q,14>7---Q,n» and this term is multiplied by +1 since the permutation 
(12...n) is even. 


2. Lower diagonal matrices: A matrix 


ay, 90 0 

Qa, 47 0 
A= 

ani an2 Ann 


whose elements above the main diagonal are all zero, is called a lower di- 
agonal matrix, and its determinant too is the product of the diagonal ele- 
ments a,,,..-,4,,- LO prove this, observe that the only way we can choose a 
nonzero element from the first row of A is to pick a,,. The second row of A 
has two nonzero elements, but since we have already chosen from the first 
column, we are forced to pick a), from the second row. Similarly, we are 
forced to pick a, from the jth row of A. Thus, det A= @j)@))...4),. 


3. Upper diagonal matrices: A matrix 


Qi Qi? eee Qin 

0 ay A>, 
A= . . 

0 0 a 
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whose elements below the main diagonal are all zero is called an upper di- 
agonal matrix and its determinant too is the product of the diagonal ele- 
ments a,,,...,4,,- To prove this, we proceed backwards. The only way we 
can choose a nonzero element from the last row of A is to pick a,,. This 
then forces us to pick a,_,,,_, from the (n—1)st row of A. Similarly, we 
are forced to pick a, from the jth row of A. Hence detA=a,,,...€)4)). 


We now derive some simple but extremely useful properties of determi- 
nants. 


Property 1. If we interchange two adjacent rows of A, then we change the 
sign of its determinant. 


Proor. Let B be the matrix obtained from A by interchanging the kth and 
(k + 1)st rows. Observe that all of the products entering into the definition 
of det B are exactly the same as the products entering into the definition of 
det A. The only difference is that the order in which we choose from the 
columns of A and B is changed. For example, let 


| 3 4 
A=|2 —-1 0 
l 2 -!1 
and 
l 3 4 
B=| 1 2 -Ili}. 
2 —-!1 0 


The product 4X22 appears in detA by choosing first from the first row, 
third column; then from the second row, first column; and finally from the 
third row, second column. This same product appears in det B by choosing 
first from the first row, third column; then from the second row, second 
column; and finally from the third row, first column. More generally, the 
term 


ay, i Ty Ak + Lager’ ** Any, 
in detA corresponds to the term 


Bij Dig, het tig 2 On, 

in detB. The sign of the first term is determined by the permutation 
Ci.» Jp Je+1++J,) While the sign of the second term is determined by the 
permutation (j,...j,4,Jj,---J,). Since the second permutation is obtained 
from the first by interchanging the kth and (k + 1)st elements, we see that 
these two terms have opposite signs. Hence det B= — det A. a 


Property 2. If we interchange any two rows of A, then we change the sign 
of its determinant. 
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PrRooF. We will show that the number of interchanges of adjacent rows re- 
quired to interchange the ith and jth rows of A is odd. Property 2 will then 
follow immediately from Property 1. To this end, assume that / is greater 
than i. We need j—i successive interchanges of adjacent rows to get the /th 
row into the ith place, and then j—i—1 successive interchanges of adjacent 
rows to get the original ith row into the jth place. Thus the total number of 
interchanges required is 2(j—i)—1, and this number is always odd. CO 


Property 3. If any two rows of A are equal, then detA=0. 

Proor. Let the ith and jth rows of A be equal, and let B be the matrix ob- 
tained from A by interchanging its ith and jth rows. Obviously, B=A. But 
Property 2 states that det B= —detA. Hence, det A= — detA if two rows of 
A are equal, and this is possible only if det A=0. O 
Property 4. detcA=c" detA. 

PROOF. Obvious. C 
Property 5. Let B be the matrix obtained from A by multiplying its ith row 

by a constant c. Then, detB=cdetA. 
ProoF. Obvious. a 


Property 6. Let A’ be the matrix obtained from A by switching rows and 
columns. The matrix A? is called the transpose of A. For example, if 


1 3 2 1 6 -1 
A={ 6 9 4], then A7=/3 9 2]. 
—-1 2 7 2 4 7 


A concise way of saying this is (A’),,=a,. Then, 
det A’ =detA. 


PROOF. It is clear that all the products entering into the definition of detA 
and detA’ are equal, since we always choose an element from each row 
and each column. The proof that these products have the same sign, 
though, is rather difficult, and will not be included here. (Frankly, 
whenever this author teaches determinants to his students he wishes that 
he were king, so that he could declare Property 6 true by edict.) 0 


Remark. It follows immediately from Properties 2, 3, and 6 that we change 
the sign of the determinant when we interchange two columns of A, and 
that detA=0 if two columns of A are equal. 


Property 7. If we add a multiple of one row of A to another row of A, then 
we do not change the value of its determinant. 
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PROOF. We first make the crucial observation that detA is a linear func- 
tion of each row of A separately. By this we mean the following. Write the 


matrix A in the form 


Qi, y2 Qin a! 
42; 422 Q2, a2 
A= . = ; 
Qn] ano Ann a” 
where 
1_ 
Ae = (11,412)---, yy) 
2_ 
A” = (A7),A7,.--,z,) 
a 
a = (G15 Gp29+++> Ann) 
Then 
a! a! 
det| cat | =cdet| gk 
q”" q” 
and 
a! a! a! 


det| at 4p | =det| g¢ |+det] 5 
a” a” a” 


For example, 


l 5 7 l 5 7 l 
det| 8 3 7|=det| 4 1 9|+det!] 4 
6 -1 0 6 -1 0O 6 


(i) 


(ii) 


5 7 
2 —2 


—1 0 


since (4, 1,9)+(4,2, —2)=(8,3,7). Now Equation (i) is Property 5. To de- 
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rive Equation (11), we compute 


a 
det} a*+p/= >> Figo (Og, + Bi) = By 
e Jloeers In 
gq" 
= > Gi Ay + yy + a8 &.. int Di oe Ay, 
Jise++dIn 


=det| «| +det! 5 


a” a” 
Property 7 now follows immediately from Equation (11), for if B is the 


matrix obtained from A by adding a multiple c of the Ath row of A to the 
jth row of A, then 


a a a a 
a/ + ca* a/ ca* aX 
det B= det : =det|: |+det] : | =detA+cdet 
a‘ aX aX aX 
a’ a’ a’ a’ 
But 
a! 
ak 
det}; : | =0 
ak 
a’ 
since this matrix has two equal rows. Hence, det B= det A. O 
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Remark 1. Everything we say about rows applies to columns, since det A? 
=detA. Thus, we do not change the value of the determinant when we add 
a multiple of one column of A to another column of A. 


Remark 2. The determinant is a linear function of each row of A sep- 
arately. It is not a linear function of A itself, since, in general, 


detcA#cdetA and det(A+B)#detA+detB. 


For example, if 


1 -2 
A= 
(3 73) 
and 
_{-1 7 
B ( 0 5): 
then 
_an(0  5\_ 
det(A +B) det 0 3] 0, 


while detA+detB=3—9= — 6. 


Property 7 is extremely important because it enables us to reduce the 
problem of computing any determinant to the much simpler problem of 
computing the determinant of an upper diagonal matrix. To wit, if 


Qi, 442 Qin 

az, 4 Arn, 
A = e e 

an} an2 Ann 


and a,,#0, then we can add suitable multiples of the first row of A to the 
remaining rows of A so as to make the resulting values of a,,,...,a,, all 
zero. Similarly, we can add multiples of the resulting second row of A to 
the rows beneath it so as to make the resulting values of a3,,...,a,, all zero, 
and so on. We illustrate this method with the following example. 


Example 3. Compute 


1 -l 2 3 

> 2 0 2 
detyy yy af 

1 2 3 0 
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Solution. Subtracting twice the first row from the second row; four times 
the first row from the third row; and the first row from the last row gives 


1 -l1 2 3 i -1 2 3 
2 2 #O 2]_4,)0 4 -4 -4 
det}, yy) 7 | = Oe 5 -9 -2B 
1 2 3 0 0 3 1 -3 

1 -l 2 3 

_ 0 1 -1 =! 

4det}9 5g 3 

0 3 | -3 


Next, we subtract five times the second row of this latter matrix from the 
third row, and three times the second row from the fourth row. Then 


1 -1 2 3 

2 2 0 2) _ 0 1 -1l -1 
det 4 1-1-1 =4det . 

l 2 3 0 


i -l 2. 3 i -l1 2. 3 
2 2 O 2. 0 1 -1 -1 
dete dy 117499 og 8 
1 2 3. 0 0 0 O -8 


= 4(—4)(—8) =128. 


(Alternately, we could have interchanged the third and fourth columns of 
the matrix 


1 —-1 2 3 
0 I -1l -1l 
0 0 -4 -8 
) 0 4 0 


to yield the same result.) 


Remark 1. If a,,=0 and a,,#0 for some j, then we can interchange the 
first and jth rows of A so as to make a,, #0. (We must remember, of 
course, to multiply the determinant by —1.) If the entire first column of A 
is zero, that is, if a,,=a,,;=...=a,,=0 then we need proceed no further 
for detA=0. 
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Remark 2. In exactly the same manner as we reduced the matrix A to an 
upper diagonal matrix, we can reduce the system of equations 


AyjX, FAX +... +a),X, =H, 
Ay) X) + AyyX_ +... + 45,X, = dD, 


Anyi X + Aj Xt... +A X, =O, 
to an equivalent system of the form 


Cy Xp HCpyXy +... +C),X%, = 4, 
Cy9Xy t+... Cy, Xx, =, 


We can then solve (if c,,,0) for x, from the last equation, for x,_, from 
the (n — 1)st equation, and so on. 


Example 4. Find all solutions of the system of equations 
Xx 1 + X> + X3 = ] 
— xX, +X, + x,=2 
2X) — Xp + x3=3. 
Solution. Adding the first equation to the second equation and subtracting 
twice the first equation from the third equation gives 
X, + x,+x,=1 
Next, adding 3 the second equation to the third equation gives 
X, + x,+x,=1 
2X, +2x,=3 


Consequently, x,= 4, x,=(3—4)/2=—2, and x,=1+2-P=—-}. 


EXERCISES 


1. Show that the system of equations 
AX, + A12X7 + Aj3x3 =D, 
Ay\X1 + A72X2 + A73X3 = bz 
43)X1 + A32X2 + A33X3= 5; 
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has a unique solution xj, x>,x3 if, and only if, 


Qi, 412, 3 
det] 42; 4. 43/0. 
Q3; 432 433 


Hint: Solve for x, in terms of x, and x; from one of these equations. 


2. (a) Show that the total number of permutations of the integers 1,2,...,” 1s 
even. 
(b) Prove that exactly half of these permutations are even, and half are odd. 


In each of Problems 3-8 compute the determinant of the given matrix. 


1 0 2 1 ¢« # 
3.11 2 5 444 72 |] 
6 8 0 {2 t l 
0 a b QO 2 -1 6 
—a 0 c O | 0 1 -!1 
. —b -c 0 QO 6 ] 3 0 2 
0 0 0 1 1 -1 1 0 
1 1 1 1 =«21 
mp 10002 
7. 7 7 810 1 0 0 3 
] l 0 1 0 
0 2 6 1 oO 1 0 4 
0 0 0 1 5 


9. Without doing any computations, show that 


a be e b h 
det!d e fl=detid a gl. 


g hik f c ik 


In each of Problems 10-15, find all solutions of the given system of equa- 
tions 


10. Xx, + x2— x3=0 11. X,+X2+x3=6 


2x, + x3=14 
X2+X3= 13 


12. X1+X2+x3z=0 
X)—X2—-x,=0 
X»+x,=0 


13. 


X, 7X2 X33 —4 
Xx +X3= —] 
X,+ X2+ x3—-X4=1 


X,+2x,-2x3+x,=1 
X,+3x,—3x3-xX4,=1 


xX, +4x,-—4x3-—x4=1 


14. Xy +X +2x3-—X4=1 15. 
Xy— XQ +2x44+X4=2 
X,+X2+2x3-x4,=1 


— X)—X_—2x3,+x,=0 


Xp XQ+XZ+ X4=0 
xX, +2x,—x3+3x,=0 
3x, +3x2—x3+7x4=0 
—X,+2x,+x3- x4=0 
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3.6 Solutions of simultaneous linear equations 


In this section we will prove that the system of equations 


Qi eee Qa; 


,b=]:] (a) 


has a unique solution x if detA+0. To this end, we define the product of 
two nXn matrices and then derive some additional properties of determi- 
nants. 


Definition. Let A and B be n Xn matrices with elements a, and b, respec- 
tively. We define their product AB as the n Xn matrix C whose ij ele- 
ment c, is the product of the ith row of A with the jth column of B. 
That is to say, 


n 
Cy = Ss Op Dy: 
k=1 


Alternately, if we write B in the form B=(b',b’,...,b”), where b/ is the 
jth column of B, then we can express the product C=AB in the form 
C=(Ab', Ab’,..., Ab”), since the ith component of the vector Ab/ is 


n 
> p= UK OK: 


Example 1. Let 


3 | -!] 
A=|0 2 ] 
1 1 I 
and 
1 -1 0O 
B= 2 -1 Ii. 
—] 0 0 
Compute AB. 
Solution. 


34+2+1 —-3-1+0 0+1+0 


3 1 -! I -1 0 
0 2 ] 2 —-1 1/=/|0+4-1 0-2+0 04+2+0 
1 ] I)(-1 0 0 142-1 —-1-1+0 0+1+0 


6 -4 1 
=|3 -2 2 
2 -2 #1 
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Example 2. Let A and B be the matrices in Example 1. Compute BA. 


Solution. 
1 -1 O73 1 -!1 3+0+0 1-2+0 —1-—1+0 
2 -1 1)];0 2 1|= 6+0+1 2-2+1 —2-1+1 
—] 0 Oj;1 1 l —3+0+0 —1+0+0 1+0+0 
3 -1 -2 
= 7 1 -2 
—3 -1 ] 


Remark 1. As Examples | and 2 indicate, it is generally not true that AB= 
BA. It can be shown, though, that 


A(BC) = (AB)C (2) 
for any three n Xn matrices A, B, and C. We will give an extremely simple 
proof of (2) in the next section. 


Remark 2. Let I denote the diagonal matrix 


1 0 0 
0 1 0 
00 .. | 


I is called the identity matrix since IA=AI=A (see Exercise 5) for any 
nXn matrix A. 


The following two properties of determinants are extremely useful in 
many applications. 


Property 8. 
det AB = det A x det B. 


That is to say, the determinant of the product is the product of the de- 
terminants. 


Property 9. Let A(i|/) denote the (n—1)x(n—1) matrix obtained from A 
by deleting the ith row and jth column of A. For example, if 


l 2 6 | ) 
A=| 0 —-1 —3},. then A(2/3) = 4 5 ) 
-4 -5 0 a 


Let c, =(— 1)'*/ det A(i|/). Then 


n 
detA= > a;;C;; 


i=] 
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for any choice of 7 between 1 and n. This process of computing determi- 
nants is known as “expansion by the elements of columns,” and Prop- 
erty 9 states that it does not matter which column we choose to expand 


about. For example, let 


| 

9 
A= 

2 

—] 


Expanding about the first, second, third, and fourth columns of A, re- 


spectively, gives 


—-1 0 7 
detA=det} 1 3 4 
6 3 § 
3 2 
+2det] —1 0 
6 3 
9 0 
=-—3det| 2 3 
—| 3 
1 2 
—det} 9 QO 
—1 3 
9 -1 
=2det]| 2 ] 
— | 6 
l 3 
—3det}9 —-] 
2 l 
9 — 
=~—6det| 2 
—] 
l 
—4det| 9 
—|] 


3 


—] 


l 
6 


WW ON 


nA hn~IN 


3 2 6 
—9det}1 3 4 
6 3 § 
6 3 2 6 
7\|+det| —1 0O 7 
5 1 3 4 
7 1 2 6 
4|—det} 2 3 4 
5 —-1 3 § 
6 1 2 6 
7\+6det!9 OO 7 
5 2 3 4 
7 l 3 6 
4/+3det 9 -] 7 
5 — | 6 5 
6 
7 
4 
1 O 1 3 2 
1 3/)+7det} 2 1 
6 3 —1 6 
3 2 l 3 
—~1 O|]+Sdeti9 —}] 
6 3 2 l 


We will derive Properties 8 and 9 with the aid of the following lemma. 


Lemma 1. Let D= D(A) be a function that assigns to each n Xn matrix A 
a number D(A). Suppose, moreover, that D is a linear function of each 


3.6 Solutions of simultaneous linear equations 


column (row) of A separately, 1.e. 
D(al,...,a/+ cb/,...,a")=D(a',...,a/,...,a") + cD (a',...,b’,...,a”), 


and D(B)=-— D(A) if B is obtained from A by interchanging two col- 
umns (rows) of A. Then 


D(A) =det AX D (I). 


A function D that assigns to each n X n matrix a number is called alter- 
nating if D(B)= — D(A) whenever B is obtained from A by interchanging 
two columns (rows) of A. Lemma | shows that the properties of being 
alternating and linear in the columns (rows) of A serve almost completely 
to characterize the determinant function detA. More precisely, any func- 
tion D(A) which is alternating and linear in the columns (rows) of A must 
be a constant multiple of detA. If, in addition, D()=1, then D(A)=detA 
for all n <n matrices A. It also follows immediately from Lemma | that if 
D(A) is alternating and linear in the columns of A, then it is also alternat- 
ing and linear in the rows of A. 


PROOF OF LEMMA |. We first write A in the form A=(a!,a’,...,a”) where 


Qi Q2 Qin 

| ar 3 a4 Ary, 
a=| . |, a=| . |,...,a°= 

an} an | ann 


Then, writing a! in the form a'=a,,e'+... +a,,e” we see that 
D(A)=D(aj,e'+... +a,,e",a2,...,”) 
=a,,D(e',a’,...,a") +... +a,,D(e",a’,...,a”") 
= 3) 4,,D (e”,a°,...,a"). 
J 
Similarly, writing a? in the form a*=a,,e!+ ... +a,,e" we see that 
D(A)= > 4y,.4y,D (e”,e,a°,...,a"). 
jad 

Proceeding inductively, we see that 


D(A)= D> 4); ay,°** a, D(e,e”,...,e%). 


J loses, Jn 


Now, we need only sum over those integers /,,/,,...,/, with j,#j, since 
D(A) is zero if A has two equal columns. Moreover, 
D (e',e”,...,e") = D(el,e’,...,e")=¢,, D(D). 


Ginn teh 
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Consequently, 
D(A)= > Ej 41j,-++ Inj D (I)=detAx D(D. O 
Ji sere, J n 
We are now in a position to derive Properties 8 and 9. 
PROOF OF PROPERTY 8. Let A be a fixed nn matrix, and define the func- 
tion D(B) by the formula 
D (B) = det AB. 


Observe that D(B) is alternating and linear in the columns b’,...,b” of B. 
This follows immediately from the fact that the columns of AB are 
Ab!,...,Ab”. Hence, by Lemma 1. 


D (B) = det B x D (I) = det B x det AI=det A x det B. O 


PROOF OF PROPERTY 9. Pick any integer 7 between | and vn and let 
D(A)= >; A,iCijs 
i=! 
where c,,=(— 1)'*/ det A(i|/). It is trivial to verify that D is alternating and 
linear in the columns of A. Hence, by Lemma 1, 
D (A)=detA X D (I) =detA. 0 


The key to solving the system of equations (1) is the crucial observation 
that 


2 a,cy=O for kAj (3) 
where c, =(— 1)'t/ det A(i|/). The proof of (3) is very simple: Let B denote 
the matrix obtained from A by replacing the jth column of A by its kth col- 
umn, leaving everything else unchanged. For example, if 


3. 5 6 
A=! 4 2 1], j=2, and k=3, 
—-1 0 -] 
then 
3 6 6 
B=| 4 1 ] |. 
—-] -1 -] 


Now, the determinant of B is zero, since B has two equal columns. On the 
other hand, expanding about the jth column of B gives 


n n 
det B= S b C= Ss Ain C;j 
i=] i=] 


314 


3.6 Solutions of simultaneous linear equations 


where 
é,=(—1)'*’ det B(i|f) =(- 1)’ det A(i|/) = c;, 


Hence, >) ;_ ,4,¢ =0 if k is unequal to j. 

Now, whenever we see a sum from | to n involving the product of terms 
with two fixed indices j and k (as in (3)), we try and write it as the jk ele- 
ment of the product of two matrices. If we let C denote the matrix whose ij 
element is c,, and set 

adj A=C’, 
then 


n 


2 ai Gy = »> (adj A) ji ik (adj A x A) jk’ 


i=l 
Hence, from (3), 
(adj AXA), =0 for j#k. 
Combining this result with the identity 


n 
detA= >) a,c,;=(adj AXA) ;, 
i=] 


we See that 
detA 0 vee 0 
0 detA 
adj AX A= ; = (det ADI. (4) 
0 0 ... detA 


Similarly, by working with A’ instead of A (see Exercise 8) we see that 


detA 0 i 0 
0 detA 
AxXadjA=| . = (det AI. (5) 
0 0 ... detA 
Example 3. Let 
1 0 —-1 
A=/1 1 —-I}. 
1 2 | 


Compute adjA and verify directly the identities (4) and (5). 


Solution. 
T 
3 -—2 | 3 -—-2 1 
adjA=C7=| —2 2 —-2| =| -2 2 O 
l 
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so that 
3 -—-2 I]/1 0 -!1 2 0 O 
adjAX A=] —2 2 O11 1 -1/=/0 2 Of =2I 
1 —2 Ijll 2 1 0 0 2 
and 
1 0 -1 3 —-2 1 2 0 O 
AxadjA=/1 1 —1]] -2 2 O|=!|0 2 OO} =2I 
1 2 l 1 -—2 1 0 0 2 


Now, detA=1—2+1+2=2. Hence, 
adj AXA=AXadjA= (det A)I. 


We return now to the system of equations 


Qi eee a, 
Ax=b, A=|- » |, x=] oj, b=] : |. (6) 


If A were a nonzero number instead of a matrix, we would divide both 
sides of (6) by A to obtain that x=b/A. This expression, of course, does 
not make sense if A is a matrix. However, there is a way of deriving the 
solution x=b/A which does generalize to the case where A is annXn 
matrix. To wit, if the number A were unequal to zero, then we can multiply 
both sides of (6) by the number A7~! to obtain that 

A7'Ax=x=A7'b. 


Now, if we can define A~! as an nXn matrix when A is an nXn matrix, 
then the expression A~'b would make perfectly good sense. This leads us 
to ask the following two questions. 


Question 1: Given an n Xn matrix A, does there exist another n X n matrix, 
which we will call A~', with the property that 


A-'A=AA7'=1? 


Question 2: If A~! exists, is it unique? That is to say, can there exist two 
distinct matrices B and C with the property that 


BA=AB=I and CA=AC=lI? 


The answers to these two questions are supplied in Theorems 7 and 8. 


Theorem 7. An nXn matrix A has at most one inverse. 


PRooF. Suppose that A has two distinct inverses. Then, there exist two dis- 
tinct matrices B and C with the property that 


AB=BA=I and AC=CAe=I. 
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If we multiply both sides of the equation AC=I by B we obtain that 
BI= B(AC) = (BA)C=IC. 


Hence, B=C, which is a contradiction. ia 


Theorem 8. A7! exists if, and only if, detA+0, and in this case 


1.1 


PRooF. Suppose that detA+0. Then, we can divide both sides of the iden- 
tities (4) and (5) by detA to obtain that 

ad) A ant=—_axadiAcax 2uA 

detA”* detAs 9” deta 
Hence, A~!=adjA/detA. 

Conversely, suppose that A~! exists. Taking determinants of both sides 
of the equation A~'A=I, and using Property 8 gives 
(detA~ ')detA=detI=1. 


But this equation implies that det A cannot equal zero. Oo 


Finally, suppose that detA#0. Then, A! exists, and multiplying both 
sides of (6) by this matrix gives 


A” 'Ax=Ix=x=A7'b. 


Hence, if a solution exists, it must be A~ 'b. Moreover, this vector is a solu- 
tion of (6) since 


A(A~'b)=AA~'b=Ib=b. 
Thus, Equation (6) has a unique solution x=A~ 'b if detA<0. 


Example 4. Find all solutions of the equation 


] ] ] 0 x} 
2 -2 2|x=/0|,  x=/| 2}. (8) 
3 3 -3 0 x3 
Solution. 
l l l 1 l 
detl2 -—2 2|=detlQO —4 0 | =24. 
3 3 -3 0 0 —-6 


Hence, Equation (8) has a unique solution x. But 


317 


3 Systems of differential equations 


is obviously one solution. Therefore, 


is the unique solution of (8). 


Remark. It is often quite cumbersome and time consuming to compute the 
inverse of an nXn matrix A from (7). This is especially true for n>4. An 
alternate, and much more efficient way of computing A7~', is by means of 
“elementary row operations.” 


Definition. An elementary row operation on a matrix A is either 
(i) an interchange of two rows, 

(ii) the multiplication of one row by a nonzero number, 

or 

(iii) the addition of a multiple of one row to another row. 


It can be shown that every matrix A, with detA+0, can be transformed 
into the identity I by a systematic sequence of these operations. Moreover, 
if the same sequence of operations is then performed upon I, it is trans- 
formed into A~!. We illustrate this method with the following example. 


Example 5. Find the inverse of the matrix 


Solution. The matrix A can be transformed into I by the following 
sequence of elementary row operations. The result of each step appears be- 
low the operation performed. 


(a) We obtain zeros in the off-diagonal positions in the first column by 
subtracting the first row from both the second and third rows. 


1 0 -!I 
0 1 0 
0 2 2 


(b) We obtain zeros in the off-diagonal positions in the second column by 
adding (— 2) times the second row to the third row. 


1 O -l 
0 1 0 
0 O 2 
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(c) We obtain a one in the diagonal position in the third column by multi- 
plying the third row by 3. 


1 O -1 
0 1 0 
0 0 ] 


(d) Finally, we obtain zeros in the off-diagonal positions in the third col- 
umn by adding the third row to the first row. 


1 0 O 
0 1 O 
0 0 1 


If we perform the same sequence of elementary row operations upon I, we 
obtain the following sequence of matrices: 


1 0 0 1 0 0 l 0 0 
O 1 O},;-1 JI Of.) -1 1 O}, 
0 0 1 -1 O 1 1 -2 1 


1 0 O)| 3 -! 3 
a I ae —] l QO |. 
y ~Il 3 1-1 } 


The last of these matrices is A~!. 


EXERCISES 


In each of Problems 1-4, compute AB and BA for the given matrices A 


and B. 
3 2 0 —-1 0 9 
1 a-(i | n-( 2 1 -i] 2, A=(} i) B=(5 7 
2 6 O 0 6 2 
101 01 1 
3.A=/|1 1 O}, B=l1 1 #0 
01 1 101 
1111 11141 
_l1 11 1 _|2 2 2 2 
4. A= 1 11 41? B 3 3 3 3 
11411 4444 


5. Show that IA= AI=A for all matrices A. 


6. Show that any two diagonal matrices A and B commute, that is AB=BA, if A 
and B are diagonal matrices. 
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7. Suppose that AD=DA for all matrices A. Prove that D is a multiple of the 
identity matrix. 


8. Prove that AXadjA=detAxI. 
In each of Problems 9-14, find the inverse, if it exists, of the given matrix. 
—2 3 2 cos6 Q -—sin# 
9. 66 (OO 3 10. 0 l 0 
4 1 -1 sing 0 cos 6 


1 1 1 2 -!1 
We} -1l Gi i 12/3 1 2 
2 1 2 3 -!1 


1 il+i 1-i 0 1 1 
13. | 1 0 0 14. | -1 0 1 
0 l l —-1 -1 QO 
15. Let 
=(a a | 
42; 422 
Show that 
-1. 1 ( 42 ~412 
A eta ( — a9] ne 
if detA+0. 


16. Show that (AB)~'=B7!A~! if detA x detB+0. 


In each of Problems 17-20 show that x=0 is the unique solution of the 
given system of equations. 


17, xX;— X2—- x3=0 18. x, +2x,+4x,=0 
3X,— X,+2x,=0 Xy»+ x,=0 
2x; +2x,+3x,=0 Xy+ X 2+ x3=0 

19, x ;+ 2x,- x; =0 20. x,t 2x,- x3+3x,=0 
2X;+ 3x.+ xX3- xX4=0 2x,;+ 3x, — x4,=0 
—X, +2x3+2x,=0 —X,;+ X_+2x3+ x4=0 
3X,;-— Xot X34+3x,=0 — X_2+2x3+3x,=0 


3.7 Linear transformations 


In the previous section we approached the problem of solving the equation 


Ay, + Ay 
Ax=b, A=| : 7 |, x=/ i], b=|: (1) 


an) eee ann Xn b 


x} b, 


by asking whether the matrix A~' exists. This approach led us to the con- 
clusion that Equation (1) has a unique solution x=A~'b if detA¥0. In 
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order to determine what happens when detA=0O, we will approach the 
problem of solving (1) in an entirely different manner. To wit, we will look 
at the set V of all vectors obtained by multiplying every vector in R” by A 
and see if b is in this set. Obviously, Equation (1) has at least one solution 
x if, and only if, b is in V. We begin with the following simple but ex- 
tremely useful lemma. 


Lemma 1. Let A be an nXn matrix with elements a;; and let x be a vector 
with components X,,X>,...,X,. Let 


denote the jth column of A. Then 


Ax=x,al+x,a°+...+x,a". 
Proor. We will show that the vectors Ax and x,a'+...+.x,a" have the 
same components. To this end, observe that (Ax),, the jth component of 
AX, 18 a,,x,+...+4,,X,, while the jth component of the vector xal+...4+ 
x,a”" 1S 


_ _ 
Xa; t... +x, a7 = xa, +... + x,4, = (Ax). 


— ] 2 n 
Hence, Ax = x,a + x,a°+...+x,a". 0 


Now, let V be the set of vectors obtained by multiplying every vector x 
in R” by the matrix A. It follows immediately from Lemma | that V is the 
set of all linear combinations of the vectors a',a’,...,a”, that is, V is 
spanned by the columns of A. Hence the equation Ax =b has a solution if, 
and only if, b is a linear combination of the columns of A. With the aid of 
this observation, we can now prove the following theorem. 


Theorem 9. (a) The equation Ax=b has a unique solution if the columns of 
A are linearly independent. 
(b) The equation Ax=b has either no solution, or infinitely many solu- 
tions, if the columns of A are linearly dependent. 


PROOF. (a) Suppose that the columns of A are linearly independent. Then, 
a',a’,...,a” form a basis for R”. In particular, every vector b can be written 
as a linear combination of a’,a’,...,a”. Consequently, Equation (1) has at 
least one solution. To prove that (1) has exactly one solution, we show that 
any two solutions 

x] 
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and 


y=| 

Yn 

must be equal. To this end, observe that if x and y are two solutions of (1), 
then 


A(x — y)=Ax— Ay=b—b=0. 
By Lemma 1, therefore, 


(x;—y, al +(x —ya)a?t ... +(x, —y,)a" =0. (2) 
But this implies that x,=y,, x=) ,..., and x,=y,, since a',a’,...,a” are 
linearly independent. Consequently, x=y. 

(b) If the columns of A are linearly dependent, then we can extract from 
a!ja’,...,a" a smaller set of independent vectors which also span V (see Ex- 
ercise 12, Section 3.3). Consequently, the space V has dimension at most 
n—1. In other words, the space V is distinctly smaller than R”. Hence, 
there are vectors in R” which are not in V. If b is one of these vectors, then 
the equation Ax=b obviously has no solutions. On the other hand, if b is 
in V, that is, there exists at least one vector x* such that Ax*=b, then 
Equation (1) has infinitely many solutions. To prove this, observe first that 
x=x* is certainly one solution of (1). Second, observe that if A€=0 for 
some vector € in R”, then x=x*+€ is also a solution of (1), since 


A(x* +£)=Ax* + A£=b+0=b. 


Finally, observe that there exist numbers c,,c>,...,c, not all zero, such that 
c,a'+cja’+...+c,a"=0. By Lemma 1, therefore, AE=0, where 


Cj 


But if A&é equals zero, then A(agé) also equals zero, for any constant a. 
Thus, there are infinitely many vectors € with the property that A€=0. 
Consequently, the equation Ax=b has infinitely many solutions. oO 


Example 1. (a) For which vectors 


b, 
b= b, 
‘bs 
can we solve the equation 
1 1 2 
Ax=/|1 0 1{x=b? 
2 1 3 


322 


3.7. Linear transformations 


(b) Find all solutions of the equation 


1 1 2 ] 
1 O 1 |x=/|0}. 
2 1 3 l 


Solution. (a) The columns of the matrix A are 


] l 2 
al=|1 |, a7=|0/ and a’=| 1). 
2 ] 3 


Notice that a! and a? are linearly independent while a° is the sum of a’ and 
a’. Hence, we can solve the equation Ax=b if, and only if, 


| 1 Cy + C2 
b=c,/1}/+c,.}/0;=] “% 
2 | 2c, +c, 


for some constants c, and c,. Equivalently, (see Exercise 25) b,= 5, + by. 
(b) Consider the three equations 


X,+x,+2x,=0 
x, + x,=0 
2x, +X,+3x,=0. 


Notice that the third equation is the sum of the first two equations. Hence, 
we need only consider the first two equations. The second equation says 
that x, = — x3. Substituting this value of x, into the first equation gives 
X= — x3. Hence, all solutions of the equation Ax=0 are of the form 


(c) Observe first that 


is clearly one solution of this equation. Next, let x? be any other solution of 
this equation. It follows immediately that x?=x!'+é&, where € is a solution 
of the homogeneous equation Ax =0. Moreover, the sum of any solution of 
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the nonhomogeneous equation 


Ax= 


l 
0 
l 


with a solution of the homogeneous equation is again a solution of the 
nonhomogeneous equation. Hence, any solution of the equation 


} 
Ax=1|0 
l 
is of the form 
0 | 
x=/1/t+c| —1)}. 
0 l 


Theorem 9 is an extremely useful theorem since it establishes necessary 
and sufficient conditions for the equation Ax=b to have a unique solution. 
However, it is often very difficult to apply Theorem 9 since it is usually 
quite difficult to determine whether n vectors are linearly dependent or lin- 
early independent. Fortunately, we can relate the question of whether the 
columns of A are linearly dependent or linearly independent to the much 
simpler problem of determining whether the determinant of A is zero or 
nonzero. There are several different ways of accomplishing this. In this sec- 
tion, we will present a very elegant method which utilizes the important 
concept of a linear transformation. 


Definition. A linear transformation @ taking R” into R” is a function 
which assigns to each vector x in R” a new vector which we call @(x)= 


@(x,,...,x,). Moreover, this association obeys the following rules. 
@(cx) = c@(x) (i) 
and 
@(x+y) = @(x)+ @(y). (ii) 


Example 2. The transformation 
x, x 
x= —>@(x)= 
Xn Xn 
is obviously a linear transformation of R” into R” since 
@(cx) = cx =c@(x) 
and 


Q(x+y)=x+y=@(x)+ @(y). 
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Example 3. The transformation 


x, X,;+X,+X3 
X3 xy 


is a linear transformation taking R? into R? since 


CX, CX, + CX3 X, FX +X; 
@(cx) = | cx, + cx, — cx3 | =c¢| x; +X_—X3 | =c@(x) 
Cx, x) 
and 
(x, +1) + (x2 +2) + (43 +93) 
@(x+y)= (x, +y,) + (x2 +2) — (x3 +3) 
xX, +y, 
Xp FX + X3 YVityrzty3 
= |X, +x.—x3| +] Yi t+¥2—Y3 | =@(x) + Cy). 
Xx) J 


Example 4. Let @(x,,x,) be the point obtained from x =(x,, x,) by rotating 
x 30° in a counterclockwise direction. It is intuitively obvious that any 
rotation is a linear transformation. If the reader is not convinced of this, 


though, he can compute 
@(x1,X2) = 1 1 


and then verify directly that @ is a linear transformation taking R? into R. 


Example 5. The transformation 


takes R* into R? but is not linear, since 


l 


x? + 4x3 


@(2x) = @(2x,,2x,)= 4 #2@(x). 


Now, every nXn matrix A defines, in a very natural manner, a linear 
transformation @ taking R’—R”. To wit, consider the transformation of 
R”—R” defined by 


x—>@(x) = Ax. 
In Section 3.1, we showed that A(cx)=cAx and A(x+y)=Ax+ Ay. Hence, 
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the association x->@(x) = Ax is linear. Conversely, any linear transforma- 
tion @ taking R’—>R” must be of the form @(x)=Ax for some matrix A. 
This is the content of the following theorem. 


Theorem 10. Any linear transformation x—>@(x) taking R" into R" must be 
of the form @(x)= Ax. In other words, given any linear transformation @ 
taking R” into R", we can find an nXn matrix A such that 


@(x) = Ax 
for all x. 


Proor. Let e/ denote the vector whose jth component is one, and whose 
remaining components are zero, and let 


a/=@(e’). 
We claim that 
@(x)=Ax, A=(a',a’,...,a") (3) 
for every 
x| 
X=|. 
x 


in R”. To prove this, observe that any vector x can be written in the form 
x=x,e'+...+x,e”. Hence, by the linearity of @, 


@(x)=@(x,e'+... +x,e") =x, @(e!) +... +x,@(e") 
= x,a'+...+x,a"= Ax. O 


Remark 1. The simplest way of evaluating a linear transformation @ is to 
compute a’ = @(e!),...,a”= @(e”) and then observe from Theorem 10 and 
Lemma | that @(x)=Ax, where A=(a!,a’,...,a”). Thus, to evaluate the 
linear transformation in Example 4 above, we observe that under a rota- 
tion of 30° in a counterclockwise direction, the point (1,0) goes into the 
point (; V3 , 5) and the point (0, 1) goes into the point (—1,1 V3 ). Hence, 
any point x=(x,,x,) goes into the point 
7V3 —3 x) 7 V3 x1— 3X2 


1 1 1 1 ; 
3 +V3 X4 5x, +3V3 x, 


Remark 2. If @ and % are linear transformations taking R” into R", then 
the composition transformation @ ° ® defined by the relation 


@ ° B(x) = @(B(x)) 
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is again a linear transformation taking R”’—>R”. To prove this, observe that 
@ © B(cx) = @(B(cx)) = @(cB(x)) = c@(B(x)) 
=c@ o B(x) 
and 
@° B(x+y)=@(B(x+y)) =@(B(x) + B(y)) 
= @(B(x)) + @(B(y)) =@ ° B(x) + @ ° Bly). 


Moreover, it is a simple matter to show (see Exercise 15) that if @(x)=Ax 
and $ (x)= Bx, then 


@ o B(x) = ABx. (4) 


Similarly, if @, ®, and @ are 3 linear transformations taking R” into R’, 
with @(x)= Ax, ® (x)= Bx, and @(x)=Cx then 


(@ > B) ° C(x) =(AB)Cx (5) 
and 
@ © (B® © ©)(x) = A(BC)x. (6) 
Now, clearly, (@ ° B)° C(x)=@ °(B © ©)(x). Hence, 
(AB)Cx = A(BC)x 
for all vectors x in R”. This implies (see Exercise 14) that 
(AB)C= A(BC) 


for any three n Xn matrices A, B, and C. 


In most applications, it is usually desirable, and often absolutely essen- 
tial, that the inverse of a linear transformation exists. Heuristically, the in- 
verse of a transformation @ undoes the effect of @. That is to say, if @(x) 
=y, then the inverse transformation applied to y must yield x. More pre- 
cisely, we define @~ '(y) as the unique element x in R” for which @(x)=y. 
Of course, the transformation @~' may not exist. There may be some vec- 
tors y with the property that y# @(x) for all x in R”. Or, there may be 
some vectors y which come from more than one x, that is, @(x')=y and 
@(x*)=y. In both these cases, the transformation @ does not possess an 
inverse. In fact, it is clear that @ possesses an inverse, which we will call 
@~' if, and only if, the equation @(x)=y has a unique solution x for every 
y in R”. In addition, it is clear that if @ is a linear transformation and @“! 
exists, then @~' must also be linear. To prove this, observe first that 
@-'(cy)=c@~'(y) since @(cx)=cy if @(x)=y. Second, observe that 
@'iy'+y?)=@ '(y') + @ '(y?) since @(x'+x2)=y'+y? if @(x!)=y! and 
@(x’)=y*. Thus @~', if it exists, must be linear. 

At this point the reader should feel that there is an intimate connection 
between the linear transformation @~' and the matrix A~'. This is the 
content of the following lemma. 
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Lemma 2. Let A be an nXn matrix and let @ be the linear transformation 
defined by the equation @(x)=Ax. Then, @ has an inverse if, and only if, 
the matrix A has an inverse. Moreover, if A~' exists, then @~'(x)=A7'x. 


Proor. Suppose that @~' exists. Clearly, 
@° @'(x)=@-! > @(x) =x (7) 


and @7' is linear. Moreover, there exists a matrix B with the property that 
@~'(x)=Bx. Therefore, from (4) and (7) 


ABx = BAx = x 
for all x in R”. But this immediately implies (see Exercise 14) that 


AB=BA=I. 


Hence B=A_!, 
Conversely, suppose that A~! exists. Then, the equation 


@(x)=Ax=y 
has a unique solution x=A_ 'y for all y in R”. Thus, A~' also exists, and 


@-'(x)=A7'x. = 


We are now ready to relate the problem of determining whether the col- 
umns of an Xn matrix A are linearly dependent or linearly independent 
to the much simpler problem of determining whether the determinant of A 
is zero Or nonzero. 


Lemma 3. The columns of an nXn matrix A are linearly independent if, and 
only if, detA+0. 


Proor. We prove Lemma 3 by the following complex, but very clever 
argument. 


(1) The columns of A are linearly independent if, and only if, the equation 
Ax=b has a unique solution x for every b in R”. This statement is just 
a reformulation of Theorem 9. 

(2) From the remarks preceding Lemma 2, we conclude that the equation 
Ax=b has a unique solution x for every b if, and only if, the linear 
transformation @(x)= Ax has an inverse. 

(3) From Lemma 2, the linear transformation @ has an inverse if, and 
only if, the matrix A~! exists. 

(4) Finally, the matrix A~! exists if, and only if, detA0. This is the con- 
tent of Theorem 8, Section 3.6. Therefore, we conclude that the col- 
umns of A are linearly independent if, and only if, detA+0. O 


We summarize the results of this section by the following theorem. 
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Theorem 11. The equation Ax=b has a unique solution x=A™ 'b if detA# 
0. The equation Ax=b has either no solutions, or infinitely many solutions 
if detA=0. 


PRooF. Theorem 11 follows immediately from Theorem 9 and Lemma 3. 


CL 

Corollary. The equation Ax=0 has a nontrivial solution (that is, a solution 
x| 
x=|.- 
x 


with not all the x; equal to zero) if, and only if, detA=0. 


Proor. Observe that 


0 
is always one solution of the equation Ax=0. Hence, it is the only solution 


if det A#0. On the other hand, there exist infinitely many solutions if detA 
=(, and all but one of these are nontrivial. C 


Example 6. For which values of A does the equation 
1 A A 
1 O 1/x=0 
1 1 1 


have a nontrivial solution? 
Solution. 
r A A 
det}1 O 1 
1 1 1 
1 A A 
1 0 }e-0 
1 1 1 


has a nontrivial solution if, and only if, A=1. 


=\+\—1-A=A~1. 


Hence, the equation 


Remark 1. Everything we’ve said about the equation Ax =b applies equally 
well when the elements of A and the components of x and b are complex 
numbers. In this case, we interpret x and b as vectors in C”, and the matrix 
A as inducing a linear transformation of C” into itself. 


Remark 2. Suppose that we seek to determine n numbers x), x>,...,x,,. Our 
intuitive feeling is that we must be given n equations which are satisfied by 
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these unknowns. This is certainly the case if we are given n linear equa- 
tions of the form 


Aj Xt AyX_+...+4,X%,=5, j=1,2,...,n (9) 
and detA+0, where 


On the other hand, our intuition would seem to be wrong when detA=0. 
This is not the case, though. To wit, if detA=0, then the columns of A are 
linearly dependent. But then the columns of A’, which are the rows of A, 
are also linearly dependent, since det A? =detA. Consequently, one of the 
rows of A is a linear combination of the other rows. Now, this implies that 
the left-hand side of one of the equations (9), say the Ath equation, is a lin- 
ear combination of the other left-hand sides. Obviously, the equation Ax = 
b has no solution if 5, is not the exact same linear combination of 
b,,.-+5D,_ 1,0, 4 15---50,- For example, the system of equations 


X,;+x,+ x,=1 
2x, +2x,=3 


obviously has no solution. On the other hand, if b, is the same linear com- 
bination of b,,...,5,_1,b,41,---55,, then the Ath equation is redundant. In 
this case, therefore, we really have only n— 1 equations for the n unknowns 
X pp Xoy +245 Xp 
Remark 3. Once we introduce the concept of a linear transformation we 
no longer need to view an n Xn matrix as just a square array of numbers. 
Rather, we can now view an n Xn matrix A as inducing a linear transfor- 
mation @(x)=Ax on R’. The benefit of this approach is that we can derive 
properties of A by deriving the equivalent properties of the linear transfor- 
mation @. For example, we showed that (AB)C=A(BC) for any 3 nXn 
matrices A, B, and C by showing that the induced linear transformations 
@, B, and @ satisfy the relation (@ °B)°>C=@ °(® °@). Now, this re- 
sult can be proven directly, but it requires a great deal more work (see Ex- 
ercise 24). 


EXERCISES 


In each of Problems 1-3 find all vectors b for which the given system of 
equations has a solution. 


Ltdio6.)6oa lx=b 212 4 4 S)yo5 
48 8 12 
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11 4 8 
3. 1 —1 1)x=b 
3 2 2 


In each of Problems 4-9, find all solutions of the given system of equa- 
tions. 


1 2 -3 ] 2 1 3 
4 F 3 -i}=0 5. [-3 —2 (3 

1 3 10 6 8 2 4 

| | 2 -1 -1 ~] 
6 x=0 7. | —5 3 1 ]x= 2 

l 1 -1l l _] i -] 3 

l 1 1 —-] 

23s ane 
8. x=0 911 —-! 1 {x=] 1 

2 O 0 1 i 1-1 \ 

2 4 6 8 


In each of Problems 10-12, determine all values of A for which the given 
system of equations has a nontrivial solution. 


A 1 3 I o-oo A 
10. 1 A 3)x= 11. x=0 
-1 3 2 0 l 0 1 
—] 0 -1 1 
(: —] 4) 
12,12 —1 0 |Ix=0 
3 rn 5 


13. (a) For which value of A does the system of equations 


—2 -4 -§5 nN 
1 -1l —-1 /x=} 1 
4 2 3 3 
have a solution? 


(b) Find all solutions for this value of A. 


14. Suppose that Ax = Bx for all vectors 


x} 


Prove that A=B. 


15. Let @ and & be two linear transformations taking R” into R”. Then, there exist 
n Xn matrices A and B such that @(x) = Ax and % (x)= Bx. Show that @ ° 8 (x) 
= ABx. Hint: @ > 8 is a linear transformation taking R” into R”. Hence, there 
exists an n Xn matrix C such that @ > ®(x)=Cx. The jth column of C is 
@ ° B(e/’). Thus show that @ ° B(e/) is the jth column of the matrix AB. 
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16. 
17. 


18. 


19. 


20. 


21. 


22. 


332 


Systems of differential equations 


Let @ be a linear transformation taking R’-—R”. Show that @(0)=0. 


Let R(A) be the linear transformation which rotates each point in the plane by 
an angle @ in the counterclockwise direction. Show that 


B(a)(x)=( cose —sint)(~"), 


sin 8 cos@ 


Let R, and R, be the linear transformations which rotate each point in the 
plane by angles 0, and 0, respectively. Then the linear transformation ®, = 
GR, ° R, rotates each point in the plane by an angle 0, + 6, (in the counterclock- 
wise direction). Using Exercise (15), show that 


cos(6,+6,) —sin(@,+8,)\ (cos@, —sin@,\/cosé, —sin§, 
sin(@,+6,)  cos(@,+9,) sin?, cos@,/\ sin®, ~—cos@,) 
Thus, derive the trigonometric identities 
sin(@, + 8, )=sin@, cos6,+cos@, sin 8, 


cos(6, + 6, )=cos6, cos 4, — sin 6, sin 43. 
Let 
_ x + X2 


(a) Verify that @ is linear. 
(b) Show that every point (x,,x,) on the unit circle xf + x3=1 goes into a point 
on the circle, x? + x3 =2. 


Let V be the space of all polynomials p(t) of degree less than or equal to 3 and 
let (Dp)(t) = dp(t)/ dt. 

(a) Show that D is a linear transformation taking V into V. 

(b) Show that D does not possess an inverse. 


Let V be the space of all continuous functions f(t), — 00 <t< oo and let (Kf)(t) 
t 
=f S(s)as. 


(a) Show that K is a linear transformation taking V into V. 
(b) Show that (DK) f=f where Df=f’. 
(c) Let f(t) be differentiable. Show that 


[(KD ) f\O=f(O—-FO). 
A linear transformation @ is said to be 1—1 if @(x)#@(y) whenever x+y. In 


other words, no two vectors go into the same vector under @. Show that @ is 
1—1 if, and only if, @(x)= 0 implies that x=0. 


. A linear transformation @ is said to be onto if the equation @(x)=y has at least 


one solution for every y in R”. Prove that @ is onto if, and only if, @ is 1—1. 
Hint: Show first that @ is onto if, and only if, the vectors @(e’),...,@(e”) are 
linearly independent. Then, use Lemma | to show that we can find a nonzero 
solution of the equation @(x)=0 if @(e!),..., @(e”) are linearly dependent. Fi- 
nally, show that @(e'),...,@(e") are linearly dependent if the equation @(x) =0 
has a nonzero solution. 
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24. Prove directly that (AB)C=A(BC). Hint: Show that these matrices have the 
same elements. 


25. Show that 


if, and only if, b;= 5, + bd. 


3.8 The eigenvalue—-eigenvector method 
of finding solutions 


We return now to the first-order linear homogeneous differential equation 


x) Qi eee Qin 


x=Ax, x=|:|, A=|: Dd. (1) 


Our goal is to find n linearly independent solutions x!(f),...,x"(t). Now, 
recall that both the first-order and second-order linear homogeneous scalar 
equations have exponential functions as solutions. This suggests that we try 
x(t)= ev, where v is a constant vector, as a solution of (1). To this end, 
observe that 

f v=heMy 
and 


A(e™v) =e Ay. 
Hence, x(t) = ev is a solution of (1) if, and only if, \e’v =e’ Av. Dividing 
both sides of this equation by e”’ gives 
Av=Dv. (2) 
Thus, x(¢)= ev is a solution of (1) if, and only if, A and v satisfy (2). 


Definition. A nonzero vector v satisfying (2) 1s called an eigenvector of A 
with eigenvalue i. 


Remark. The vector v=0 is excluded because it is uninteresting. Obvi- 
ously, AO=A:0 for any number X. 


An eigenvector of a matrix A is a rather special vector: under the linear 
transformation x—>Ax, it goes into a multiple A of itself. Vectors which are 
transformed into multiples of themselves play an important role in many 
applications. To find such vectors, we rewrite Equation (2) in the form 


0= Av—Av=(A—AD)v. (3) 
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But, Equation (3) has a nonzero solution v only if det(A—AI)=0. Hence 
the eigenvalues A of A are the roots of the equation 


a\,—A a12 Gin 

a Ay—-N a. a>, 
O=det(A-Al)=det| — ane 

an} an Ann —A 


and the eigenvectors of A are then the nonzero solutions of the equations 
(A—AlDv=0, for these values of A. 

The determinant of the matrix A—AI is clearly a polynomial in A of de- 
gree n, with leading term (—1)"A”. It is customary to call this polynomial 
the characteristic polynomial of A and to denote it by p(A). For each root 
A; of p(A), that is, for each number A, such that p(A,)=0, there exists at least 
one nonzero vector w such that Av =\iW. Now, every polynomial of de- 
gree n > 1 has at least one (possibly complex) root. Therefore, every matrix 
has at least one eigenvalue, and consequently, at least one eigenvector. On 
the other hand, p(A) has at most n distinct roots. Therefore, every n Xn 
matrix has at most n eigenvalues. Finally, observe that every n Xn matrix 
has at most n linearly independent eigenvectors, since the space of all vec- 
tors 


has dimension n. 


Remark. Let v be an eigenvector of A with eigenvalue A. Observe that 
A(cv)=cAv= cAv=X(cv) 

for any constant c. Hence, any constant multiple (c 40) of an eigenvector 

of A is again an eigenvector of A, with the same eigenvalue. 


For each eigenvector w of A with eigenvalue A;, we have a solution x/ (f) 


=e'y of (1). If A has n linearly independent eigenvectors v',...,v” with 
oe ie Aj5---,A, respectively (A,,...,A, need not be distinct), then x/ (ft) 
= eh!) j=1,...,n are n linearly independent solutions of (1). This follows 


immediately from Theorem 6 of Section 3.4 and the fact that x/(0)=W. In 
this case, then, every solution x(f) of (1) is of the form 


x(t)=c,e*"v' + c,e"v? +... +0,e%"V". (4) 


This is sometimes called the “general solution” of (1). 
The situation is simplest when A has n distinct real eigenvalues 
Aj,Az,-+-5A, With eigenvectors v!,v’,...,v” respectively, for in this case we 
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are guaranteed that v',v’,...,v” are linearly independent. This is the con- 
tent of Theorem 12. 


Theorem 12. Any k eigenvectors v',...,v* of A with distinct eigenvalues 
Ajio+++9A, respectively, are linearly independent. 


PRooF. We will prove Theorem 12 by induction on k, the number of ei- 
genvectors. Observe that this theorem is certainly true for k=1. Next, we 
assume that Theorem 12 is true for k=/. That is to say, we assume that 
any set of 7 eigenvectors of A with distinct eigenvalues is linearly indepen- 
dent. We must show that any set of 7+1 eigenvectors of A with distinct ei- 
genvalues is also linearly independent. To this end, let v',...,v7*! be j+1 
eigenvectors of A with distinct eigenvalues A,,...,A;,, respectively. To de- 
termine whether these vectors are linearly dependent or linearly indepen- 
dent, we consider the equation 


ci +o +...+64,77'=0. (5) 
Applying A to both sides of (5) gives 
Ayeyv FAQ? +... FA; 4 164.1771 =0. (6) 


Thus, if we multiply both sides of (5) by A, and subtract the resulting equa- 
tion from (6), we obtain that 


(A, —A,)egv? + tA A A, )¢; C4107! =0. (7) 


But v*,...,v/*! are j eigenvectors of A with distinct eigenvalues ),...,A;4, 
respectively. By the induction hypothesis, they are linearly independent. 
Consequently, 


(A, — Aj en = 9, (A3—Aj)e3=0,..., and (Aj41—Ay)Gj41=9. 


Since A,,A),...,A;4; are distinct, we conclude that op C3,-++,C4, are all zero. 


Equation (5) now forces c, to be zero. Hence, v', v’, wt are linearly in- 
dependent. By induction, therefore, every set of k eigenvectors of A with 
distinct eigenvalues is linearly independent. 0 


Example 1. Find all solutions of the equation 


1 -Il 4 
K= | 3 2 —1'|x. 
2 1 -1 


Solution. The characteristic polynomial of the matrix 


1 —-] 4 
3 2 —-1 
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1S 
I-A -1 4 
3 2—-AX —1 
2 ] —1-A 
= —(1+A)(1—A)(2—A) +24 12—8(2—A) + (1 -A) —3(1 +A) 
=(1—A)(A—3)(A+2). 
Thus the eigenvalues of A are A, =1, A,=3, and A,= —2. 
(i) A, =1: We seek a nonzero vector v such that 


0 -1 4//°1 0 
amye|3 1 -1))e2|=[o} 
2 1 —2j[%3 0 


— v, +40, =0, 30, + v, — v, =9, and 2v,+ v,—20,=0. 


p(A) = det(A — AI) = det 


This implies that 


Solving for v, and v, in terms of v, from the first two equations gives v, = 
—v, and v,=40v,. Hence, each vector 


—] 
v=c 4 
l 
is an eigenvector of A with eigenvalue one. Consequently, 
—] 
ce'| 4 
1 
is a solution of the differential equation for any constant c. For simplicity, 


we take 
—] 
one ‘| 
1 


(ii) A, =3: We seek a nonzero vector v such that 


—2 -1 4)|% 0 
(Aap | 5 -i}||-[9} 
2 1 -4 


This implies that 


Solving for v, and v, in terms of v, from the first two equations gives v, = 
v, and v,=2v;. Consequently, each vector 
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is an eigenvector of A with eigenvalue 3. Therefore, 


x*(t) =e?! A 


l 
is a second solution of the differential equation. 
(iii) A, = —2: We seek a nonzero vector v such that 
3 -1 4)1°% 0 
(A+2Iv=/3 4 -1]/%]/=/0]. 
2 ] 1} (%3 0 


This implies that 
30, — 0» +40, =0, 30, +4v,—0;=0 and 2v,+0,+0,=0. 


Solving for v, and v, in terms of v, gives v, = — v; and v,=v,. Hence, each 


vector 
—] 
Vv=C l 
l 


is an eigenvector of A with eigenvalue —2. Consequently, 


—1 
x?( t) _ e 2! | 1 
l 
is a third solution of the differential equation. These solutions must be lin- 
early independent, since A has distinct eigenvalues. Therefore, every solu- 
tion x(t) must be of the form 


—] l —1 
x(t)=cyje'| 4] +c,e*}2 vee i 
It ] 

—ce'+ coe —ce~* 


4c,e'+2c,e%+c,e 7]. 


t 3t —2t 
cje + coe +03e 


Remark. If A is an eigenvalue of A, then the n equations 


ajj0, + eee +(a,—A)v,+ .. +4,,0, =0, J=1,...,1 


are not independent; at least one of them is a linear combination of the 
others. Consequently, we have at most n— 1 independent equations for the 
n unknowns 0),...,0,. This implies that at least one of the unknowns 
U},---,v, can be chosen arbitrarily. 
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Example 2. Solve the initial-value problem 


-(} Bs x0-(2) 


Solution. The characteristic polynomial of the matrix 


-{1 12 
a=(5"t) 


p(r)=der( 17% 12 )=(1-A)?-36=(A-7)(A+5). 


1S 


3 1—-A 


Thus, the eigenvalues of A are A, =7 and A,= —5. 
(i) 4, =7: We seek a nonzero vector v such that 


acniva(“$ 2) 2)4(8) 


This implies that v, =2v,. Consequently, every vector 


_ *) 

. e( 

is an eigenvector of A with eigenvalue 7. Therefore, 
x'(t)= en (? 


is a Solution of the differential equation. 
(ii) A, = —5: We seek a nonzero vector v such that 


(s e)-t8 
+ 5I)v= = . 
(A+ Dy (5 6)\2)" \o 
This implies that v, = —2v,. Consequently, 
(1) 
1 
is an eigenvector of A with eigenvalue —5, and 
(=e * = 
is a second solution of the differential equation. These solutions are lin- 


early independent since A has distinct eigenvalues. Hence, x(t)=c,x'(t)+ 
c,x°(t). The constants c, and c, are determined from the initial condition 


(°}= eix!(0) +epx%(0)=( 761 +( ~*), 
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Thus, 2c, —2c,=0 and c,+c,=1. The solution of these two equations is 
c,=4 and c,=+. Consequently, 


; ; git oW5t 
x= ze"(T)+ 50H i) | ony 1 e-s 
2 2 


Example 3. Find all solutions of the equation 


l 
2 
x=Ax= | 3 
4 


WM BR W dN = 

A Bm WN — 

OA Bm WN — 

OM &WN — 
ras 


5 


Solution. It is not necessary to compute the characteristic polynomial of A 
in order to find the eigenvalues and eigenvectors of A. To wit, observe that 


l 
2 
AX=A| 3 | =(x, + x, +%3+%4+X5)] 3 |. 
4 
5 


Hence, any vector x whose components add up to zero is an eigenvector of 
A with eigenvalue 0. In particular 


] 0 0 0 

0 l 0 0 

v=! ol], v=| ol, w=! 1], and v=| Oo 
0 0 0 l 

—]| —] —] —] 


are four independent eigenvectors of A with eigenvalue zero. Moreover, 
observe that 
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The five vectors v', v’, v’, v*, and v° are easily seen to be linearly indepen- 
dent. Hence, every solution x(t) is of the form 


l 0 0 0 l 
0 l 0 0 2 
x(th=c,| Olt+e,| O]+e3;| L}+eg} 0] tese'* | 3 
0 0 0 l 4 
—1 —1 —] —] 5 
EXERCISES 
In each of Problems 1-6 find all solutions of the given differential equa- 
tion. 
._(6 —3 _(-2 1 
1 i=(6 x 2. X (~3 3) 
3 2 4 7 —1 6 
3.x=|2 0 2 |x 4.x=|-10 4 —-12 |x 
4 2 3 —2 ] —1 
1 2 3 £6 
-7 0 6 
sa-(70 § o)s aa-]} § 2 Bh 
602 7 14 21 4 
In each of Problems 7-12, solve the given initial-value problem. 
7. x=(1 x, x =(3} 8. x=(_5 “3 )x, x()=(9} 


3 1 -!1 l 
f 3 -1)s xo-( -2] 
3 3 -!1 —] 
1 -1 O —] 
10. (1 2 fs xo-( -4] 
1 10 2 13 


1 -3 2 —2 
11.x=[0 —-1 0 |x, x(0)= 0 
0 -1 -2 3 


3 1 -2 I 
12, x=[-1 2 1}x, x@)=] 4 
4 1 -3 —7 


13. (a) Show that e*¢~y, ¢, constant, is a solution of x=Ax if AV=Av. 
(b) Solve the initial-value problem 


3 1 -2 l 
x={-—-1 2 L}x, x(l)=] 4 
4 1 -3 —7 
(see Exercise 12). 
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14. Three solutions of the equation x = Ax are 


e! + e2! e!+e! e'—e! 
et |, ef and — e3! 
0 er! _ er! 


Find the eigenvalues and eigenvectors of A. 

15. Show that the eigenvalues of A~! are the reciprocals of the eigenvalues of A. 
16. Show that the eigenvalues of A” are the nth power of the eigenvalues of A. 
17. Show that A\=0 is an eigenvalue of A if detA=0. 


18. Show, by example, that the eigenvalues of A+B are not necessarily the sum of 
an eigenvalue of A and an eigenvalue of B. 


19. Show, by example, that the eigenvalues of AB are not necessarily the product 
of an eigenvalue of A with an eigenvalue of B. 


20. Show that the matrices A and T~'AT have the same characteristic polynomial. 


21. Suppose that either B~! or A~! exists. Prove that AB and BA have the same 
eigenvalues. Hint: Use Exercise 20. (This result is true even if neither B~! or 
A! exist; however, it is more difficult to prove then.) 


3.9 Complex roots 


If \=a+if is a complex eigenvalue of A with eigenvector v=y' + iv’, then 
x(t)= ey is a complex-valued solution of the differential equation 


k= Ax. (1) 
This complex-valued solution gives rise to two real-valued solutions, as we 
now show. 
Lemma 1. Let x(t)=y(t)+ iz(t) be a complex-valued solution of (1). Then, 
both y(t) and z(t) are real-valued solutions of (1). 
ProorF. If x(t) = y(t) + iz(t) is a complex-valued solution of (1), then 
y(t) + iz(t) = A(y(t) + iz(t)) = Ay(t) + (Az(¢). (2) 


Equating real and imaginary parts of (2) gives y(t)=Ay(t) and z(t)=Az(?). 
Consequently, both y(t)=Re{x(t)} and z(¢)=Im{x(t)} are real-valued 
solutions of (1). 0 


The complex-valued function x(t) = e@* ‘8 (y! + iv?) can be written in 
the form 
x(t) =e (cos Br + isin Bt)(v' + iv’) 
=e (v' cos Bt —v’sin Bt) + i(v' sin Br + v’ cos Br) |. 
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Hence, if \=a+t if is an eigenvalue of A with eigenvector v=v! + iv’, then 
y(t) =e (v' cos Bt —v’ sin Br) 

and 
z(t) =e™ (v' sin Bt + v’ cos Bt) 

are two real-valued solutions of (1). Moreover, these two solutions must be 


linearly independent (see Exercise 10). 


Example 1. Solve the initial-value problem 


gy ok orl 
KX=/0 1 —-1 |x, x(O)=/{ 1]. 
I 


0 1 l 
Solution. The characteristic polynomial of the matrix 
1 O 0 
A=/0 1 -1 
0 1 1] 


iS 


1-A O 0 
p(A)=det(A—AI)=det|] 0 I-A «6-1 
0 I 1—A 


=(1-A)?+(1—A)=(1—A)(A?2— 2A +2). 


Hence the eigenvalues of A are 


A=1 and A= 


+i. 


2+V4-8 _ 
2 


(i) A=1: Clearly, 


is an eigenvector of A with eigenvalue 1. Hence 


] 
x'(t)=e! F 
0 


is one solution of the differential equation x= Ax. 
(ii) \=1+i7: We seek a nonzero vector v such that 


ian civonly=| > -1)}2|=[9} 


0 1 —Tj[% 
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This implies that — iv, =0, —iv,—v,=0, and v,—iv,=0. The first equa- 
tion says that v, =0 and the second and third equations both say that v= 
iv,. Consequently, each vector 
; 
V=Cl i 


I 


is an eigenvector of A with eigenvalue 1+i. Thus, 


x(t)= e(iti: j 


is a complex-valued solution of the differential equation x= Ax. Now, 


_ {0 0 0 
ett) 7 | =e’ (costt+isin£)| |O} +i] 1 
] 0 
0 0 0 0 
=e'lcost!0|—sin¢|1]|+isint!|0|+icost¢| | 
1 0 | 0 
0 0 
=e'| —sint | +ie’| cost}. 
cost sin t 


Consequently, by Lemma 1, 


x*(t) =e! 


0 0 
—sint and x°(t)=e'| cost 
cost sin t 


are real-valued solutions. The three solutions x!(f), x°(t), and x°(f) are lin- 
early independent since their initial values 


] 0 0 
x'(0)= 0} x?(0) = | 1 and x(0)= ‘| 
0 0 l 


are linearly independent vectors in R®. Therefore, the solution x(t) of our 
initial-value problem must have the form 


l 0 0 
x(t)=c,e'|0|+c,e’| —sint | +c3;e’| cost]. 
0 cost sin t 
Setting t=0, we see that 
| l 0 0 C 
1 =C 0 + Cy 0 + C3 1|= C3 . 
] 0 ] 0 C2 
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Consequently c,;=c,=c,=1 and 


] 0 0 
x(t)=e’ q +e! | —sint | +e’ cn 
0 cost sint 
l 
=e'| cost—sint |. 
cost+sint 


Remark. If v is an eigenvector of A with eigenvalue A, then v, the complex 
conjugate of v, is an eigenvector of A with eigenvalue A. (Each component 
of v is the complex conjugate of the corresponding component of v.) To 
prove this, we take complex conjugates of both sides of the equation Av= 
Av and observe that the complex conjugate of the vector Av is Av if A is 
real. Hence, AV=Av, which shows that v is an eigenvector of A with eigen- 
value X. 


EXERCISES 


In each of Problems 1—4 find the general solution of the given system of 
differential equations. 


1 -5 0 
1 x=(73 _i)x 2 a(1 —3 0) 


0 0 1 
1 O 0 1 0 ] 
3.x=/3 1 —-2)x 4.x=]| 0 1 -1)x 
2 2 l —2 0 -l 
In each of Problems 5-8, solve the given initial-value problem. 
_— l —] — l v= 3 —2 = I 
> (; 73) *®) (3) 6. x= (j ~{)s xO (5) 
—3 0 2 0 
7.%={ 1 -1 O]}x, x)=[-1 
-2 -l1 0 —2 
0 2 0 0 l 
-.|-2 0 0 0 _|1 
8. x 00 0-3” x(0) I 
0 0 3 O 0 


9. Determine all vectors x° such that the solution of the initial-value problem 
] 0 -2 
K=10 1 Ofx, x(0)=x° 
1 -1l -!1 


is a periodic function of time. 
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10. Let x(*)= ev be a solution of x=Ax. Prove that y(t)=Re{x(t)} and z2(t)= 
Im{x(¢)} are linearly independent. Hint: Observe that v and ¥ are linearly inde- 
pendent in C” since they are eigenvectors of A with distinct eigenvalues. 


3.10 Equal roots 


If the characteristic polynomial of A does not have n distinct roots, then A 
may not have n linearly independent eigenvectors. For example, the matrix 


1 1 O 
0 1 O 


0 0 2 


has only two distinct eigenvalues A, =1 and A,=2 and two linearly inde- 
pendent eigenvectors, which we take to be 


3) = i 


Consequently, the differential equation x= Ax has only two linearly inde- 


pendent solutions 
l 0 
e'lQ| and e|Q 
0 l 


of the form e*’v. Our problem, in this case, is to find a third linearly inde- 
pendent solution. More generally, suppose that the n x n matrix A has only 
k <n linearly independent eigenvectors. Then, the differential equation 
x = Ax has only k linearly independent solutions of the form ev. Our 
problem is to find an additional n—k linearly independent solutions. 

We approach this problem in the following ingenious manner. Recall 
that x(#)=e%c is a solution of the scalar differential equation x = ax, for 
every constant c. Analogously, we would like to say that x(t)=e4‘v is a 
solution of the vector differential equation 


k= Ax (1) 


for every constant vector v. However, e4’ is not defined if A is annXn 

matrix. This is not a serious difficulty, though. There is a very natural way 

of defining e“’ so that it resembles the scalar exponential e; simply set 
A’? A"t” 


At — 
e =I+At+ t+... + nl + .... (2) 


It can be shown that the infinite series (2) converges for all t, and can be 


345 


3 Systems of differential equations 


differentiated term by term. In particular 


n+1 
4 MAT AY4 at AN "4 _ 
dt ni 


=A/I+Ar+ tA .. |=AeM 


This implies that e4’v is a solution of (1) for every constant vector v, since 


da Aty — A pAly = At 
ae Ae“y=A(e”'v). 


Remark. The matrix exponential e“‘ and the scalar exponential e™ satisfy 
many similar properties. For example, 


(eA) = e Af and eAlt+s) — eAtoAs. (3) 


Indeed, the same proofs which show that (e“)~'=e7% and e*“*+9 = e%e® 
can be used to establish the identities (3): we need only replace every a by 
A and every | by I. However, e4‘*®! equals e4“e®’ only if AB=BA (see Ex- 
ercise 15, Section 3.11). 


There are several classes of matrices A (see Problems 9-11) for which 
the infinite series (2) can be summed exactly. In general, though, it does 
not seem possible to express e“’ in closed form. Yet, the remarkable fact is 
that we can always find n linearly independent vectors v for which the in- 
finite series e“’vy can be summed exactly. Moreover, once we know ni lin- 
early independent solutions of (1), we can even compute e*’ exactly. (This 
latter property will be proven in the next section.) 

We now show how to find n linearly independent vectors v for which 
the infinite series e“’y can be summed exactly. Observe that 


A (A-ADt eli 


e“v=e v 
for any constant A, since (A—ADQAD =(AI(A—Al). Moreover, 
272,2 
eAlty = [I+ale+ A mi +... | 


Hence, eA’v = e“eA~ADly, 

Next, we make the crucial observation that if v satisfies (A—AD”v=0 
for some integer m, then the infinite series e4~*”’v terminates after m 
terms. If (A—Al)"v=0, then (A—AI)”*'y is also zero, for every positive in- 
teger /, since 


(A—AI)”*'v=(A—A) [(A—AD)"v] =0. 
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Consequently, 
eA-Dty =v + t(A—Alv+ +o sy (A—AI)”~'v 
and 
oAty = eMp(A—ADty 
=e“lv+t(A—Alvt+...+ Cn =r —~——(A-AI)”"'v 


This suggests the following algorithm for finding n linearly independent 
solutions of (1). 

(1) Find all the eigenvalues and eigenvectors of A. If A has vn linearly in- 
dependent eigenvectors, then the differential equation x = Ax has n linearly 
independent solutions of the form ey. (Observe that the infinite series 
e‘A-ADty terminates after one term if v is an eigenvector of A with eigen- 
value A.) 

(2) Suppose that A has only k <n linearly independent eigenvectors. 
Then, we have only k linearly independent solutions of the form ev. To 
find additional solutions we pick an eigenvalue A of A and find all vectors 
v for which (A—AI)’v=0, but (A—ADv<0. For each such vector v 


eAty = eMe(A- My = eM y+ ¢(A—AD)v | 


is an additional solution of x= Ax. We do this for all the eigenvalues X of 
A. 

(3) If we still do not have enough solutions, then we find all vectors v 
for which (A—AI)*v=0, but (A—AI)*v+0. For each such vector v, 


eAty = el y+ 1(A—Al)v+ © (A-M)v 


is an additional solution of x= Ax. 

(4) We keep proceeding in this manner until, hopefully, we obtain n lin- 
early independent solutions. 

The following lemma from linear algebra, which we accept without 
proof, guarantees that this algorithm always works. Moreover, it puts an 
upper bound on the number of steps we have to perform in this algorithm. 


Lemma 1. Let the characteristic polynomial of A have k distinct roots 
Aj,---A, with multiplicities nj,,...,n, respectively. (This means that p(A) 
can be factored into the form (A, —A)"...(A, —A)*.) Suppose that A has 
only v;<n, linearly independent eigenvectors with eigenvalue A;. Then the 
equation (A- AI)’ =0 has at least v,+1 independent solutions. More 
generally, if the equation (A—A, "v= 0 has only m,<n, independent 
solutions, then the equation (A— -A, I)”*'v=0 has at least m,+1 indepen- 
dent solutions. 
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Lemma | clearly implies that there exists an integer d, with d,<n,, such 
that the equation (A— —AD4v= 0 has at least n, linearly independent solu- 
tions. Thus, for each eigenvalue A, of A, we can compute n, linearly inde- 
pendent solutions of x= Ax. All these solutions have the form 
(A-AI)*'v ‘| 


x(t)=e\1 v+t(A-ADv+... 


‘Ge 7 


In addition, it can be shown that the set of n,+...+n,=n solutions thus 
obtained must be linearly independent. 


Example 1. Find three linearly independent solutions of the differential 


equation 
1 1 0 
x=/0 1 O|x. 
0 0 2 
Solution. The characteristic polynomial of the matrix 
1 1 0 
A=/0 1 0 
0 0 2 


is (1—A)*(2—A). Hence A\=1 is an eigenvalue of A with multiplicity two, 
and A=2 is an eigenvalue of A with multiplicity one. 
(i) \=1: We seek all nonzero vectors v such that’ 


O 1 O};% 0 
0 O Of} ¥2| =| 0]. 
0 0 I)[% 0 
This implies that v, =v, =0, and v, is arbitrary. Consequently, 


x'(t)=e' q 
0 


is one solution of x=Ax. Since A has only one linearly independent eigen- 
vector with eigenvalue 1, we look for all solutions of the equation 


; 0 1 O|/O 1 O 0 0 OO}; % 0 
(A-I)/v=/0 0 O}/0 O O|v=/0 O Of] %)=]0 
0 0 1j(0 O 1 0 0 1){% 0 


This implies that v,=0 and both v, and v, are arbitrary. Now, the vector 


(A-Dv= 
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satisfies (A—I)’v=0, but (A—I)v+0. (We could just as well choose any 


v7 
“i 
0 
for which v, #0.) Hence, 


0 0 
| = e'elA-Ds l 
0 0 


0 
=e'1e(a-0)})1] =e! 
0 


x(t) =e” 


0 
I 
0 


0 ] | 
=e'!l1]}+7e'|0|=e'! 1 
0 0 0| 


is a second linearly independent solution. 
(ii) A=2: We seek a nonzero vector v such that 


—] 1 O}]% 0 
(A—21Dv= 0 -1 O}] |=] 0}. 
0 0 0j)[% 0 


This implies that v, = v,=0 and v, is arbitrary. Hence 


x?(t) =e”! q 
l 


is a third linearly independent solution. 


Example 2. Solve the initial-value problem 


F l 1 H 
Xx=!10 2 —I1Ix, x(0)=/] 2}. 
0 0 2 l 


Solution. The characteristic polynomial of the matrix 


is (2—A)>. Hence A=2 is an eigenvalue of A with multiplicity three. The ei- 
genvectors of A satisfy the equation 


0 1 31) 01 0 
(A—21I)v=|0 O —-I1]|%}]=) 0}. 
0 O 0} | 3 0 
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This implies that v,=v,=0 and 0, is arbitrary. Hence 


l 
x'(t) =e”! q 
0 


is one solution of x= Ax. 
Since A has only one linearly independent eigenvector we look for all 


solutions of the equation 
v7 0 
02; =/0]. 
03 0 


; 0 | 3/|}0 1 3 0 0 -1 
(A-21)'v=|0 0 -1//0 0 —-I1/v=!0 0 0 
This implies that v,=0 and both v, and v, are arbitrary. Now, the vector 


0 0 0)10 O Oj}; (0 0 O 


satisfies (A — 2I)’v=0, but (A—21v+0. Hence 


0 0 
x(t) = eM 1|= e2te(A—2Di 1 
0 0 


0 
-(n+4a-2p)]}=< I+1 
0 


0 1 3] || 0 
0 0 -!1 l 
0 0 0) 1 0 


0 l t 
=e) 1) +¢/0] [=e 1 1 
0 0 0 


is a second solution of x= Ax. 
Since the equation (A —2I)*v=0 has only two linearly independent solu- 
tions, we look for all solutions of the equation 
0 
“Eh 
0 


3 

; 0 1 3 0 0 OO}: 

(A-21I)v=/0 0 —-1/]v=/0 O O]]% 
Obviously, every vector v is a solution of this equation. The vector 


0 0 0 0 0 O){% 
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does not satisfy (A —21)*v=0. Hence 


0 
x?(£) = e™! 0 
l 


0 
= gig(A-Wt! 4 
l 


2 0 
=e*|1+1(A-21)+ y(A-21) | q 
1 


0 3 >{-1 3r—50? 
=e 10}+t)-1]/+>] Of; ]=et| _, 
| 0 0 1 


is a third linearly independent solution. Therefore, 


1 t 3t—42? 
x(t)=e~1 c,|O}+c,| 1] +0, _ 
0 0 1 


The constants c,, c,, and c, are determined from the initial conditions 


] l 0 0 
2) =c,|0} +c.) 1] +0310}. 
l 0 0 l 


This implies that c,=1, c,=2, and c;=1. Hence 


1+5t-30 
x(th=e"} 4_, 
J 
For the matrix A in Example 2, p(A)=(2—A)> and (2I— A)’ =0. This is 
not an accident. Every matrix A satisfies its own characteristic equation. 
This is the content of the following theorem. 


Theorem 13 (Cayley-Hamilton). Let p(A)=pot+p,A+... +(—1)"A” be the 
characteristic polynomial of A. Then, 


p(A)=po+p,At+...+(—1)'A"=0. 


FAKE PROOF. Setting A=A in the equation p(A)=det(A—AlI) gives p(A)= 
det(A — AI) =det0=0. The fallacy in this proof is that we cannot set \=A 
in the expression det(A —AI) since we cannot subtract a matrix from the di- 
agonal elements of A. However, there is a very clever way to make this 
proof kosher. Let C(A) be the classical adjoint (see Section 3.6) of the 
matrix (A—AI). Then, 


(A—ADC(A)=p(A)L. (4) 
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Each element of the matrix C(A) is a polynomial in A of degree at most 
(n— 1). Therefore, we can write C(A) in the form 


C(A)=C,+CAt...+C,_ A"! 


where G,,...,C,_, are n Xn matrices. For example, 
A+A?—s- 2A -(0 o)+(? 2) 4 1? 0 
2? 1-A} \O 17 VO -A? Nd? 0 
_(0 °) (’ 2) (1 ° 
(ST) +A ~1)**\1 of 
Thus, Equation (4) can be written in the form 


(A—AD[C,+CA+...+C,_ A"! ]=poltp Alt ...+(-1)'A"L (5) 


Observe that both sides of (5) are polynomials in A, whose coefficients are 
nXn matrices. Since these two polynomials are equal for all values of A, 
their coefficients must agree. But if the coefficients of like powers of A 
agree, then we can put in anything we want for A and still have equality. In 
particular, set A=A. Then, 


p(A)=p.l+p,At...+(—1)"A" 
=(A—AD[C)+C,At+...+C,_,A"~']=0. Oo 


EXERCISES 


In each of Problems 1—4 find the general solution of the given system of 
differential equations. 


0 -!1 l 1 1 | 
1x=|2 -—3 1 |x 2. x= > 1 -1'Ix 
1 -1 -1 —3 2 4 


-l -l 0 
3. -| 0 -1 os Hint: Look at Example 1 of text. 


f Problems 5-8, solve the given initial-value problem 


—-1 1 2 1 —-4 -4 0 2 
5.x=|-1 1 1]x, xO =1{0 6 x={ 10 9 Ix, xO=] 1 
—2 1 3 1 -~4 -3 1 —] 


3 0 0 Q l 
Ll 2 -3 1300 i 
7,x=( 1 1 2 |x, x(0)=/0] 8 x= x, x(0)= 
| op 0 0 0 3 0 
0 0 2 3 l 


9. Let 
A, O 0 
0 A 0 
A= ; 
0 0 hy 
Show that 
et 0 
oat 0 et 9g 
0 0 era! 
10. Let 
A 1 0 
A=!10 A 1 
0 0A 
Prove that 
vii! st? 
At__ ,Mf 
ew=e"lo 1 4 
0 0 1 


11. 


Hint: Write A in the form 


and observe that 


Let A be the n Xn matrix 


nN | 0 
0 A l 0 
0 0 0 
0 oO O A 
and let P be the n Xn matrix 
0 l 0 
0 l 0 


0 


io) 

Com) S eo @ e 
© 

S 


Equal roots 
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(a) Show that P”=0. (b) Show that AIDP=PAI). 
(c) Show that 


2p2 n—-1 
eA! = eM 1+ P+ OF + ppl . 
2! (n—1)! 


12. Compute e™ if 


210 0 
_lo0 21 0 
@A=)9 9 2 1 

0002 

210 0 210 0 
_10 21.0 _|0 200 
)A=19 9 2 Of (A='y 9 2 0 

0002 0002 


13. (a) Show that eT '4T=T~-'e4T. 


(b) Given that 
1 0 0 
T 'AT=|0 2 0 


0 0 -!i 


1 -1 -1l 
T=|3 -1 2], 
2 2 3 


14. Suppose that p(A)=det(A—AI) has n distinct roots A,,...,A,. Prove directly that 
p(A)=(— 1)"(A—A,D...(A—A,D =0. Hint: Write any vector x in the form x= 
x;v'+... +x,v" where v!,...,v" are n independent eigenvectors of A with eigen- 
values A,,...,A, respectively, and conclude that p(A)x=0 for all vectors x. 

15. Suppose that A?= aA. Find e“’. 


16. Let 


with 


compute e4’. 


> 

l 
| lll eel oll el omen 
| ell coe ll eon meee 
| tll cee coe ce 
pew peepee eh pee 
omet femeh  peeh peek eh 


(a) Show that A(A—5I)=0. 
(b) Find e4¢. 


17. Let 


(a) Show that A?7=—I. 
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(b) Show that 


eM = cost sin’) 
—sint cost) 


In each of Problems 18-20 verify directly the Cayley-Hamilton Theorem 
for the given matrix A. 


1 -1 -Il 2 3 1 -1 0 1 
18. A=| 2 1 -1] 19. A=[2 3 1] 20. A=[-1 3 0O 
-1 -!l ] 2 3 1 2 4 6 


3.11 Fundamental matrix solutions; e”’ 


If x'(t),...,x"(t) are n linearly independent solutions of the differential 
equation 
x = Ax, (1) 
then every solution x(t) can be written in the form 
x(t)=c,x'(t)+c)x?(t)+ ... +0,x"(t). (2) 


Let X(t) be the matrix whose columns are x'(t),...,x"(t). Then, Equation 
(2) can be written in the concise form x(t)= X(t)c, where 


Definition. A matrix X(f) is called a fundamental matrix solution of (1) if its 
columns form a set of n linearly independent solutions of (1). 


Example 1. Finda fundamental matrix solution of the system of differen- 
tial equations 
1 -1 4 
x= 3 2 -1 bs (3) 


2 1 —-] 
Solution. We showed in Section 3.8 (see Example 1) that 


—] ] —] 
et] Al, e!12| and e~*| 4 
l l ] 


are three linearly independent solutions of (3). Hence 


—e! et —e~4! 
X(t) = 4e! 203! ew 
e! e! et! 


is a fundamental matrix solution of (3). 
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In this section we will show that the matrix e“’ can be computed di- 
rectly from any fundamental matrix solution of (1). This is rather remark- 
able since it does not appear possible to sum the infinite series [I+ At+ 
(At)*/2!+...] exactly, for an arbitrary matrix A. Specifically, we have the 
following theorem. 


Theorem 14. Let X(t) be a fundamental matrix solution of the differential 
equation x= Ax. Then, 


eA! =X(t)X7'(0). (4) 


In other words, the product of any fundamental matrix solution of (1) with 
its inverse at t=O must yield e™. 


We prove Theorem 14 in three steps. First, we establish a simple test to 
determine whether a matrix-valued function is a fundamental matrix solu- 
tion of (1). Then, we use this test to show that e“’ is a fundamental matrix 
solution of (1). Finally, we establish a connection between any two funda- 
mental matrix solutions of (1). 


Lemma 1. A matrix X(t) is a fundamental matrix solution of (1) if, and only 
if, X(t) = AX(t) and det X(0)+0. (The derivative of a matrix-valued func- 
tion X(t) is the matrix whose components are the derivatives of the corre- 
sponding components of X(t).) 


Proor. Let x!(f),...,x"(#) denote the n columns of X(t). Observe that 


X(1) =(x'(0),...,x"(0)) 
and 
AX(t) =(Ax'(z),...,Ax”(t)). 


Hence, the n vector equations x!(t)= Ax'(Z),...,x"(t)=Ax"(t) are equiv- 
alent to the single matrix equation X(‘)=AX(f). Moreover, n solutions 
x'(r),...,X"(t) of (1) are linearly independent if, and only if, x!(0),...,x”(0) 
are linearly independent vectors of R”. These vectors, in turn, are linearly 
independent if, and only if, det X(0)#0. Consequently, X(¢) is a fundamen- 
tal matrix solution of (1) if, and only if, X(7)=AX(t) and detX(0)40. 


Lemma 2. The matrix-valued function eA‘ =1+ At+A7t?/2!+... is a 
fundamental matrix solution of (1). 


PRooF. We showed in Section 3.10 that (d/dr)e*’=Ae*’. Hence e™ is a 
solution of the matrix differential equation X(t)=AX(t). Moreover, its de- 
terminant, evaluated at r=0, is one since e4°=I. Therefore, by Lemma 1, 
e™' is a fundamental matrix solution of (1). O 


Lemma 3. Let X(t) and Y(t) be two fundamental matrix solutions of (1). 
Then, there exists a constant matrix C such that Y(t)=X(t)C. 
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ProoF. By definition, the columns x'(2),...,x"(t) of X(t) and y'(0),..., 
y"(t) of Y(t) are linearly independent sets of solutions of (1). In particular, 
therefore, each column of Y(t) can be written as a linear combination of 
the columns of X(t); ie., there exist constants c/,...,c/ such that 


y’(t) = cf{x!(t) + c§x?(t) + _ + c/x"(t), J=l,...,n. (5) 


Let C be the matrix (c!,c’,...,c”) where 


Then, the n equations (5) are equivalent to the single matrix equation Y(t) 
= X(Z)C. O 
We are now in a position to prove Theorem 14. 


PROOF OF THEOREM 14. Let X(¢) be a fundamental matrix solution of (1). 
Then, by Lemmas 2 and 3 there exists a constant matrix C such that 


eA! =X(r)C. (6) 
Setting t=0 in (6) gives I= X(0)C, which implies that C=X~'(0). Hence, 
eA! = X(1)X~ !(0). a 
Example 1. Find e“ if 
1 1 1 
A=/|0 3 2). 
0 0 5 


Solution. Our first step is to find 3 linearly independent solutions of the 
differential equation 
1 1 1 
X=!10 3 2)|x. 


0 0 5 


To this end we compute 


1-A l l 
r0)= (AX =de 0 3-27 2 |-0-ne-n-». 
0 0 5-AX 
Thus, A has 3 distinct eigenvalues \=1, \=3, and A=5. 
(i) A=1: Clearly, 
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is an eigenvector of A with eigenvalue one. Hence 


l 
x'(t)=e! q 
0 


is one solution of x= Ax. 
(11) \=3: We seek a nonzero solution of the equation 


Po 3 e}- 
(A-31Iv=| 0 O 2]]%2|=] 0}. 
0 0 2) % 0 


This implies that v,=0 and v,=2v,. Hence, 


is an eigenvector of A with eigenvalue 3. Consequently, 


l 
x?(t) = e* F 
0 


is a second solution of x= Ax. 
(111) A=5: We seek a nonzero solution of the equation 


—4 1 1)/" 0 
(Ast | 0 -2 [=| [9} 
0 0 Oj [% 0 


This implies that v, =v, and v, =v,/2. Hence, 


is an eigenvector of A with eigenvalue 5. Consequently, 


l 
x?(t) =e E 

2 
is a third solution of x=Ax. These solutions are clearly linearly indepen- 
dent. Therefore, 


e! et ef 
X(4)=|0 2e* 2e™ 
0 0 2e* 


is a fundamental matrix solution. Using the methods of Section 3.6, we 
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compute 
_, {1 -3 0 
1 1 177 ? 
x-o)=/0 2 2] =]0 2 -3 
Therefore, 
| 1 i e! e3t est I —3 0 
exp F 3 a =10 2e* 2e%||0 7 73 
0 0 5 0 O. 2e* 0 ; 
e! —tet41e3 —je%+te* 
= 0 es! — e3 +e! 
0 0 oo! 
EXERCISES 
Compute e“’ for A equal 
1 -1 -1 1 1 -1 2 0 1 
1/1 3 4 2(2 3 -4 31 0 1 
-3 1 -1 4 1 -4 1 -2 0 
10 0 0 2 1 ee | 
32 | 1 1 -1 0 -! I 
7. Find A if 
2e7*—e! ee! e'—e7 
eAl = e2t — of 2 e2! ~ e! e!— e! . 


3e7%7-3e! 3e2'-3e' 3e'-—2e! 


In each of Problems 8-11, determine whether the given matrix is a funda- 
mental matrix solution of x= Ax, for some A; if yes, find A. 


e! ef ef +2e! 
8. | ef -e-' e'-2e™'! 
2e' e-' 2e't+e‘) 


—5cos2t —5S5sin2¢ 3e! 
9. | —2(cos2¢+sin2¢) 2(cos2t—sin2t) 0 
cos 2t sin 2¢ e7! 
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1 ¢+1 #41 e Qe! et 
10. e"} 1 24t+1) 47 Wl. |2e! em! 
1 ¢+2 3 3e6 em! 2 


12. Let $/(t) be the solution of the initial-value problem x= Ax, x(0)=e’. Show 
that e4’=($',¢7,...,6”). 


13. Suppose that Y(¢)=X(1)C, where X(¢) and Y(¢) are fundamental matrix solu- 
tions of x = Ax, and C is a constant matrix. Prove that detC+0. 


14, Let X(7) be a fundamental matrix solution of (1), and C a constant matrix with 
det C40. Show that Y(#)=X(a)C is again a fundamental matrix solution of (1). 


15. Let X(t) be a fundamental matrix solution of x=Ax. Prove that the solution 
x(t) of the initial-value problem x= Ax, X(f9)=x° is x(1) = X(0)X7 '(to)x°. 


16. Let X(t) be a fundamental matrix solution of x= Ax. Prove that X(#)X~ (to) = 
eA to). 


17. Here is an elegant proof of the identity e4’t ®’ = e4“e®’ if AB=BA. 

(a) Show that X(t) = e4‘*® satisfies the initial-value problem X =(A+B)X, 
X(0)=I. 

(b) Show that e4‘B=Be“ if AB=BA. (Hint: AYB=BA/ if AB=BA). Then, 
conclude that (d/dt)e“‘e®’ =(A+B)e™eB'. 

(c) It follows immediately from Theorem 4, Section 3.4 that the solution X(¢) 
of the initial-value problem X=(A+B)X, X(0)=I, is unique. Conclude, 
therefore, that e4/t+ B! = eA‘eB', 


3.12 The nonhomogeneous equation; 
variation of parameters 


Consider now the nonhomogeneous equation x = Ax+f(7). In this case, we 
can use our knowledge of the solutions of the homogeneous equation 


x = Ax (1) 

to help us find the solution of the initial-value problem 
X=Ax+f(t), x(t) =x”. (2) 
Let x'(f),...,x"(0) be n linearly independent solutions of the homogeneous 


equation (1). Since the general solution of (1) is c,x'(t)+... +¢,x"(0), it is 
natural to seek a solution of (2) of the form 


x(t) = uy (t)x!(t) + up (t)x2(t) +... + u,(t)x"(2). (3) 


This equation can be written concisely in the form x(t)=X(s)u(t) where 
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X(t) =(x!(t),...,x"()) and 
u(t) 
u(“y=| : 
u,,(t) 
Plugging this expression into the differential equation x =Ax+f(¢) gives 
X(t)u(t) + X(t)u(t) = AX(t)u(t) +£(2). (4) 


The matrix X(t) is a fundamental matrix solution of (1). Hence, X(t)= 
AX(t), and Equation (4) reduces to 


X(é)u(t) = £(7). (5) 


Recall that the columns of X(¢) are linearly independent vectors of R” at 
every time ¢. Hence X~ '(f) exists, and 


a(t) =X '(z)f(2). (6) 
Integrating this expression between /, and ¢ gives 
t 
u(t)=u(to)+ f X~"(s)f(s)ds 
Lo 
=X~'(tg)x°+ [°X7"(s)f(s) ds. 
Lo 
Consequently, 
t 
x(t) =X(1t)X~ "(to)x° + X(2) f X~!(5)f(s) ds. (7) 
Lo 


If X(t) is the fundamental matrix solution e“‘, then Equation (7) sim- 
plifies considerably. To wit, if X(¢)=e4’, then X~'(s)=e~“°. Hence 


t 
x(t) = eA’e~ Alox® + eM f e ASF(s) ds 


lo 


= elroy [eA E(5) ds (8) 
ut) 


Example 1. Solve the initial-value problem 


1 O 0 0 0 
x=/2 1 -—2 0 , x(0)=] 1]. 
e’cos2t ] 


3 2 l 


x+ 
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Solution. We first find e4‘, where 


1 0 0 
A=|2 1 —2|]. 
3 2 l 
To this end compute 
1-A 0 0 
det(A—AI)=det} 2  1-A —2 |=(1—A)(A?—2A+4+5). 
3 2 1-A 
Thus the eigenvalues of A are 
\=1 and \=22 aoe 1+2i. 


(i) \=1: We seek a nonzero vector v such that 


0 0 Of) [0 
(A-Dv=|2 0 —2||e/=Jol. 
3 2 0} | 03 0 


This implies that v, =v, and v,= —3v,/2. Hence 


is an eigenvector of A with eigenvalue 1. Consequently, 


2 
(one 3 
2 


is a solution of the homogeneous equation x = Ax. 
(ii) \=1+2i: We seek a nonzero vector v such that 


—-2i 0 0 |)" 0 
Acai p=| 2 —2i “2]|=-[o} 
3 2 —2i) ( %3 


This implies that v, =0 and v,= — iv,. Hence, 


0 
v= l 
—1 


is an eigenvector of A with eigenvalue 1 +2i. Therefore, 


0 . 
x(t) = 1 ell+2it 


—1 
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is a complex-valued solution of k= Ax. Now, 


0 . 0 0 
1 jet?" = e'(cos2¢+isin2¢)| |1|—i]0 
0 


—j 
0 0 
1|/+sin2¢|0 
0 0 
1|—cos2t| 0] |. 
0 l 


0 
0 0 
x*(t)=e'|cos2r| and x (t)=e'|  sin2r 
sin2t —cos2t 


=e'!lcos2t 


+ ie'| sin2t 


Consequently, 


are real-valued solutions of x= Ax. The solutions x', x”, and x° are linearly 
independent since their values at t=0 are clearly linearly independent vec- 
tors of R*. Therefore, 


2e' 0 0 
X(t)=| —3e' —e' cos2t e‘sin2t 
2e' e'sin2t —e'cos2t 


is a fundamental matrix solution of x =Ax. Computing 


20 o, | {2 9 
X~'(0)=|-3 1 0 =|3 1 
2 90 7} 10 -!1 
we see that 
2e' 0 0 3 0 
eM =! 36! ef cos2t e‘sin2t||3 1 
2e' e'sin2t -—e‘cos2t)|},; 9 —] 
] 0 0 
=e!|— 3+ $cos2t+sin2¢ cos2¢ —sin2r}. 
1+3sin2¢—cos2t sin2¢ — cos2z 
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Consequently, 
0 
x(t)=e""| 1 
] 
' | 0 0 0 
ve fe — 3+ 3cos2s—sin2s cos2s sin2s | 0 \é 
0 1—3sin2s—cos2s —sin2s cos2s| le cos2s 
0 t 0 
=e'| cos2t¢—sin2t ve f sin2scos2s | ds 
cos2¢+sin2¢ 0 cos?25 
0 
me |e” sina +e} (1—cos4r)/8 
cos2¢+sin2¢ t/2+(sin4t)/8 
0 
=e'! cos2¢—sin2t 
cos2¢+sin2¢ 
0 
tsin2t , cos2t—cos4tcos2¢t—sin4tsin2t 
(oe PE EES 
+e 2 8 
tcos2t + sin 4tcos2¢t —sin 2¢cos4¢+ sin2¢ 
8 
0 


—p'| cos2t—(1+4)sin2¢ 


(1 + 41)cos2¢+ 3 sin2¢ 


As Example | indicates, the method of variation of parameters is often 
quite tedious and laborious. One way of avoiding many of these calcula- 
tions is to “guess” a particular solution y(t) of the nonhomogeneous equa- 
tion and then to observe (see Exercise 9) that every solution x(¢) of the 
nonhomogeneous equation must be of the form $(t)+ Y(t) where $(Z) is a 
solution of the homogeneous equation. 


Example 2. Find all solutions of the differential equation 


I O 0 
X=/|2 1 -2 


3. 2 ] 


J 
0 
0 


x+1O0}e%, c#I. (9) 
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Solution. Let 


1 0 0 
2 1 -2). 
3 2 l 


We “guess” a particular solution y(t) of the form y(t)=be. Plugging this 
expression into (9) gives 


1 
che“ = Abe“ + | 0 |e“, 


Or 


This implies that 


Hence, every solution x(t) of (9) is of the form 


2 
x(t)=e'] c,| —3 
2 


0 
cos2t 
sin2t 


+ Cy + C3 


0 
—sin2t 


cos 2t 
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Remark. We run into trouble when c=1 because one is an eigenvalue of 


the matrix 
1 O 0 
2 1 -2'|. 
3 2 | 


More generally, the differential equation x= Ax+ve may not have a 
solution of the form be“ if c is an eigenvalue of A. In this case we have to 
guess a particular solution of the form 


P(th=e"[bot bet... +b,_ f°" "| 


for some appropriate integer k. (See Exercises 10-18). 


EXERCISES 


In each of Problems 1—6 use the method of variation of parameters to 
solve the given initial-value problem. 


1. c=(_3 _5)x+(4e'cos*), x(0)=(9} 
1 att} x=) 


+ ~3)x+(Sint), x@=(9) 


—-1 -l -2 l 0 
6.x=[ 1 l 1|x+10 Je’, x(0)=| 0 
2 1 3 0 0 

7. Consider the nth-order scalar differential equation 


d"y d"~ly 
Lly]= Gin tay te tay =F): (*) 


Let v(t) be the solution of L[y]=0 which satisfies the initial conditions 
y(O)=...=y%-20)=0, y@~ 9(0) = 1. Show that 


y()= i ‘v(t—s) f(s)ds 


is the solution of (*) which satisfies the initial conditions y(0)=... =y@~ 20) = 
0. Hint: Convert (*) to a system of n first-order equations of the form x= Ax, 
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and show that 


v(t) 
v(t) 


v@— Y(t) 
is the nth column of e*. 


8. Find the solution of the initial-value problem 


d¥y dy _ — .,/ yy _ 
ae * de Sect tant, y(0)=y’(0) =y”(0) = 0. 


9. (a) Let ¥(t) be a solution of the nonhomogeneous equation x=Ax+f(t) and 

let @(f) be a solution of the homogeneous equation x = Ax. Show that $(1) 
+ y(t) is a solution of the nonhomogeneous equation. 

(b) Let y,(¢) and y(t) be two solutions of the nonhomogeneous equation. 
Show that y,(4) — y¥(f) is a solution of the homogeneous equation. 

(c) Let y(t) be a particular solution of the nonhomogeneous equation. Show 
that any other solution y(t) must be of the form y(t)=(t)+ y(t) where 
(ft) is a solution of the homogeneous equation. 


In each of Problems 10-14 use the method of judicious guessing to find a 
particular solution of the given differential equation. 


= 2 l ] 3 - — 1 — |] — 2 
10. x (3 _5)e+( Je 11. x (| 3 )x+| P| 
l 3 2 sint 
12. x=] —] 2 1 }xt+ 0 
4 -] -!1 0 
l 2 -3 1 
13. x=] 1 | 2 ixt+ 0 Je’ 
1 -1 4 —] 


-1 -!1 0 l 
14.x=| 0 -4 —-1]x+] 2 
0 5 0 e! 


15. Consider the differential equation 


x = Ax + ve (*) 
where v is an eigenvector of A with eigenvalue A. Suppose moreover, that A has 
n linearly independent eigenvectors v',v’,...,v", with distinct eigenvalues 


A}>-++,A, respectively. 

(a) Show that (*) has no solution y(t) of the form ¥(t)=ae™. Hint: Write a= 
aw'+...+a,v". 

(b) Show that (*) has a solution y(t) of the form 


y(t)=ae"+bre™. 
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Hint: Show that b is an eigenvector of A with eigenvalue A and choose it so 
that we can solve the equation 


(A—-ADa=b-v. 


In each of Problems 16-18, find a particular solution y(t) of the given dif- 
ferential equation of the form y(1)= e (a+b). 


1 1 -1 l 
16.x={[2 3 —4]x+1 2 Je 

4 1 -4 l 

1 —-1 —-! 1 

17. x= 1 3 1 Ix+{ —1 le* 

—3 1 —-1 —|] 

3 2 4 2 
18. x=/2 09 2])x+11 Je* 

4 2 3 2 


3.13 Solving systems by Laplace transforms 


The method of Laplace transforms introduced in Chapter 2 can also be 
used to solve the initial-value problem 


K=Ax+f(t),  — x(0)=x®. (1) 
Let 
x, (s) [rex (oa 
X(s)=| ° =£{x(t)} = 
X,,(s) 0 ast 
i ex (t) dt 
and 
F,(s) fre "hat 
F(s)=| 2 |=£(EG)} = 
F,,(s) east 
f e “f (t)dt 


Taking Laplace transforms of both sides of (1) gives 
R{x(2)} =£{Ax(z) + f(t) } =AL{x(r)} + P{ f(t) } 
= AX(s)+F(s), 
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and from Lemma 3 of Section 2.9, 


{x1 (¢)} sX, (s)— x, (0) 


EiXMJ=) 2 = . 
£(%,()}| | sX,(s)—x,(0) 
= sX(s)—x®. 
Hence, 
sX(s) —x°=AX(s) + F(s) 
or 
1 0 0 
(sl —A)X(s) =x° + F(s), =| | a |. (2) 
00 .. | 


Equation (2) is a system of n simultaneous equations for X,(s),...,X,,(s), 
and it can be solved in a variety of ways. (One way, in particular, is to mul- 
tiply both sides of (2) by (sI—A)~ '.) Once we know X,(s),...,X,,(s) we can 
find x,(t),...,x,(¢) by inverting these Laplace transforms. 


Example 1. Solve the initial-value problem 


(} de(Hes aor) 


Solution. Taking Laplace transforms of both sides of the differential equa- 


tion gives 
_{2\_fl 4 l (1) 
sX(s) (7) (; 1)x()+ s-1\1 
or 
(8 1)X, (s)— 4X3 (s)=2+ + - 
X, (3) +(s-1)X,(s)=14+ 
The solution of these equations is 
2 ] | S 
X,(s)= + ,  X(s)= + ———__________., 
1 (5) s—3 s2?—] 2 (5) 5—3 (s—1)(s+1)(s—3) 
Now, 


y) / l ; tf ,7t 
soy =P (2e"}, and St =e{sinns) =£{ Se}. 
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Hence, 
t_ .-t 
x,(t)=2e% + —— 


To invert X,(s), we use partial fractions. Let 


— 8s CU A, BO 
(s—1)(st+1)(s—3) s-1 s+1 s—3 
This implies that 
A(t Is 3) + Bls~ s— 3) + C(s— sts. (4) 
Setting s=1, —1, and 3 respectively in (4) gives A= —4, B=—3, and 


2. Consequently, 


EXERCISES 


Find the solution of each of the following initial-value problems. 


wae(_4 3x 20-(3 


ma-(5 Z3)s x-(2) 

vse Bp+() 20-8 
cao(l De 2)e x0-(8) 
see} -f)e(IJe. 20-(1 
ce(} p(s) 0-9) 
7. x=(_5 _3)x+ (4e“eose), x0)=(1) 
e(f de(Za} 0-6) 
re( Beet) 20-(0 
10. i=(3 3 )x+ (THO ) x(0)=(°) 
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OoOW © 


3.13 Solving systems by Laplace transforms 


l l 
oa (o} x-( 
3 | 
1 tpe(o}es so-(5) 
1 1 |xt+loOfe’, xO= 
1 3 0 


0 
0 
0 
(eb o-('] 
—2 |x+ 0 , x(O)=1[ 1 
] e'cos2t 1 


wn 
ad 
a 
=) 
~~ 
ll 
ey 
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Qualitative theory of 
differential equations 


4.1 Introduction 


In this chapter we consider the differential equation 


x = f(t,x) (1) 
where 
x, (t) 
x=| : |, 
x,(1) 
and 
Fy (bX yy 000 X,) 
f(t,x)= : 
Fa (Es X p90 00 Xp) 
is a nonlinear function of x,,...,x,. Unfortunately, there are no known 


methods of solving Equation (1). This, of course, is very disappointing. 
However, it is not necessary, in most applications, to find the solutions of 
(1) explicitly. For example, let x,(¢) and x,(¢) denote the populations, at 
time ¢t, of two species competing amongst themselves for the limited food 
and living space in their microcosm. Suppose, moreover, that the rates of 
growth of x,(¢) and x,(7) are governed by the differential equation (1). In 
this case, we are not really interested in the values of x,(t) and x,(?) at ev- 
ery time ¢. Rather, we are interested in the qualitative properties of x,() 
and x,(t). Specically, we wish to answer the following questions. 
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1. Do there exist values €, and §, at which the two species coexist 
together in a steady state? That is to say, are there numbers &,, €, such that 
x (=), x(t) = 1s a solution of (1)? Such values &,,&,, if they exist, are 
called equilibrium points of (1). 

2. Suppose that the two species are coexisting in equilibrium. Suddenly, 
we add a few members of species 1 to the microcosm. Will x,(¢) and x,(?) 
remain close to their equilibrium values for all future time? Or perhaps the 
extra few members give species | a large advantage and it will proceed to 
annihilate species 2. 

3. Suppose that x, and x, have arbitrary values at t=0. What happens 
as t approaches infinity? Will one species ultimately emerge victorious, or 
will the struggle for existence end in a draw? 

More generally, we are interested in determining the following proper- 
ties of solutions of (1). 

1. Do there exist equilibrium values 


for which x(t)=x° is a solution of (1)? 

2. Let @(¢) be a solution of (1). Suppose that (1) is a second solution 
with ¥(0) very close to ¢ (0); that is, y,(0) is very close to $,(0), j=1,...,n. 
Will Y(t) remain close to @(Z) for all future time, or will Y(7) diverge from 
@ (t) as ¢ approaches infinity? This question is often referred to as the prob- 
lem of stability. It is the most fundamental problem in the qualitative the- 
ory of differential equations, and has occupied the attention of many 
mathematicians for the past hundred years. 

3. What happens to solutions x(t) of (1) as ¢ approaches infinity? Do all 
solutions approach equilibrium values? If they don’t approach equilibrium 
values, do they at least approach a periodic solution? 

This chapter is devoted to answering these three questions. Remarkably, 
we can often give satisfactory answers to these questions, even though we 
cannot solve Equation (1) explicitly. Indeed, the first question can be 
answered immediately. Observe that x(f) is identically zero if x(t)=x°. 
Hence, x° is an equilibrium value of (1), if, and only if, 


f(t,x°)=0. (2) 
Example 1. Find all equilibrium values of the system of differential equa- 


tions 


dx 2. 3 
=X, +X. 


dee 
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0 
xo=| 
x} 
is an equilibrium value if, and only if, 1—x9=0 and (x?) +x2=0. This 


implies that x2=1 and x?= —1. Hence ( —- | 


Solution. 


is the only equilibrium value 


of this system. 


Example 2. Find all equilibrium solutions of the system 


Ha(e-yy-),  Ba(xt (+0). 


XQ 
ye) 
is an equilibrium value of this system if, and only if, (x)— 1)(yp— 1) =0 and 
(Xo + 1)(¥9 + 1) =0. The first equation is satisfied if either x9 or yp is 1, 


while the second equation is satisfied if either x9 or yo is —1. Hence, x=], 
y=-—1 and x=~—], y=1 are the equilibrium solutions of this system. 


Solution. 


The question of stability is of paramount importance in all physical 
applications, since we can never measure initial conditions exactly. For 
example, consider the case of a particle of mass one kgm attached to an 
elastic spring of force constant 1 N/m which is moving in a frictionless 
medium. In addition, an external force F(t)=cos2t N is acting on the 
particle. Let y(t) denote the position of the particle relative to its equi- 
librium position. Then (d7y/dt*)+ y =cos2t. We convert this second-order 
equation into a system of two first-order equations by setting x,= y, 


x, = y’. Then, 
dx dx, 
Wt G7 Txt cos2t. (3) 


The functions y,(t)=sin¢ and y,(t)=cost are two independent solutions of 
the homogeneous equation y” +y =0. Moreover, y= — }cos2t is a particu- 
lar solution of the nonhomogeneous equation. Therefore, every solution 


_ x(t) 
m= | | 


of (3) 1s of the form 


x()=c,( SI!) + c4/ cost) 4 | — ;cos2t (4) 


cost —sint 
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At time t=0 we measure the position and velocity of the particle and ob- 
tain y(0)= 1, y’(0)=0. This implies that c, =0 and c,= 4. Consequently, the 
position and velocity of the particle for all future time are given by the 


equation 
(7° )-(203)-| erajene | (5) 


y(t) x(t) — 4sint+ 2sin2t 


However, suppose that our measurements permit an error of magnitude 
10~*. Will the position and velocity of the particle remain close to the val- 
ues predicted by (5)? The answer to this question had better be yes, for 
otherwise, Newtonian mechanics would be of no practical value to us. For- 
tunately, it is quite easy to show, in this case, that the position and velocity 
of the particle remain very close to the values predicted by (5). Let y(t) and 
y(t) denote the true values of y(¢) and y’(t) respectively. Clearly, 


y(t)—¥ (t)=(F —cy)cost—c,sint 
y'(t)—¥'(t)= — ¢, cost —(¢— c,)sint 
where c, and c, are two constants satisfying 
—10-*<c,<1074, = - 104K c, < $ +1074. 


We can rewrite these equations in the form 


. 2,!1/2 Cc 
y()-I(=|ei+($-e) | cos(r—8,),  tandy=—; 
3 


ya-y(oO= lei +(3 — c2) | M” cost — 5, ), tand,= 39 


Hence, both y(t)—y(t) and y’(t)— y(t) are bounded in absolute value by 
[c? +($—c,)"]'*. This quantity is at most V2 10~*. Therefore, the true 
values of y(#) and y’(t) are indeed close to the values predicted by (5). 

As a second example of the concept of stability, consider the case of a 
particle of mass m which is supported by a wire, or inelastic string, of 
length / and of negligible mass. The wire is always straight, and the system 
is free to vibrate in a vertical plane. This configuration is usually referred 
to as a simple pendulum. The equation of motion of the pendulum is 


where y is the angle which the wire makes with the vertical line AO (see 
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Figure 1 


Figure 1). Setting x,=y and x, = dy /dt we see that 


aie 77 Ssinx. (6) 
The system of equations (6) has equilibrium solutions x, =0, x,=0, and x, 
=, x,=0. (If the pendulum is suspended in the upright position y=7 
with zero velocity, then it will remain in this upright position for all future 
time.) These two equilibrium solutions have very different properties. If we 
disturb the pendulum slightly from the equilibrium position x,=0, x,=0, 
by either displacing it slightly, or giving it a small velocity, then it will ex- 
ecute small oscillations about x, =0. On the other hand, if we disturb the 
pendulum slightly from the equilibrium position x,=7, x,=0, then it will 
either execute very large oscillations about x,=0, or it will rotate around 
and around ad infinitum. Thus, the slightest disturbance causes the 
pendulum to deviate drastically from its equilibrium position x, = 7, x,=0. 
Intuitively, we would say that the equilibrium value x, =0, x,=0 of (6) is 
stable, while the equilibrium value x,=7, x,=0 of (6) is unstable. This 
concept will be made precise in Section 4.2. 

The question of stability is usually very difficult to resolve, because we 
cannot solve (1) explicitly. The only case which is manageable is when 
f(¢,x) does not depend explicitly on ¢; that is, f is a function of x alone. 
Such differential equations are called autonomous. And even for autono- 
mous differential equations, there are only two instances, generally, where 
we can completely resolve the stability question. The first case is when f(x) 
= Ax and it will be treated in the next section. The second case is when we 
are only interested in the stability of an equilibrium solution of x=f(x). 
This case will be treated in Section 4.3. 

Question 3 is extremely important in many applications since an answer 
to this question is a prediction concerning the long time evolution of the 
system under consideration. We answer this question, when possible, in 
Sections 4.6—4.8 and apply our results to some extremely important ap- 
plications in Sections 4.9-4.12. 
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EXERCISES 


In each of Problems 1-8, find all equilibrium values of the given system of 
differential equations. 


1. 


10. 


11. 


& = y— x? 2xy 2. o = — Bry +p 
@ = 2y—2y?-3xy © = By —y 
. & = ax— bry 4, Bs — x xy? 
<= — cy + dry O = -y-yx? 
@ aztxtty? @ at-2tx? 
H = xy? x 6 & = cosy 
ds, dy 
a sy GH 7 sinx! 
= —|-y-e 8. Zax? 
ax? +y(e*=1) O ax?) 
© a xtsinz de etx 
. Consider the system of differential equations 
T= axt by, Oe mex dy (*) 


(i) Show that x =0, y=0 is the only equilibrium point of (*) if ad— bc 40. 
(11) Show that (*) has a line of equilibrium points if ad— bc =0. 


Let x= x(t), y=y(t) be the solution of the initial-value problem 
=—x- y =2x— 0)=y(0)=1 
at X—Y; dt X—~Y; x( y=y( )=1. 


Suppose that we make an error of magnitude 10~* in measuring x(0) and y(0). 
What is the largest error we make in evaluating x(t), y(t) for 0< t< 00? 


(a) Verify that 


is the solution of the initial-value problem 


11 1 2 1 
:-( 2 1 -i)-| Je «o)=(0) 
-3 2 4 ~3 0 
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(b) Let x=y(t) be the solution of the above differential equation which satis- 
fies the initial condition 


] 
xo=x4( 0} 
0 


Show that each component of y(t) approaches infinity, in absolute value, 
as [> ©. 


4.2 Stability of linear systems 


In this section we consider the stability question for solutions of autono- 
mous differential equations. Let x=@(¢) be a solution of the differential 
equation 


X= f(x). (1) 
We are interested in determining whether ¢(f) is stable or unstable. That is 
to say, we seek to determine whether every solution y(t) of (1) which starts 
sufficiently close to $(t) at t=O must remain close to (f) for all future 
time ¢ 20. We begin with the following formal definition of stability. 


Definition. The solution x = ¢(f) of (1) is stable if every solution (7) of (1) 
which starts sufficiently close to (tf) at +=0 must remain close to $(f) 
for all future time ¢. The solution (ft) is unstable if there exists at least 
one solution y(¢) of (1) which starts near @(¢) at t=0 but which does 
not remain close to $(Z) for all future time. More precisely, the solution 
(t) is stable if for every e>0 there exists 6=6(e) such that 


W()-e(Dl<e if |¥(0)—4,0)|<d(e), f=1,...,0 
for every solution y(¢) of (1). 


The stability question can be completely resolved for each solution of 
the linear differential equation 


x= Ax. (2) 


This is not surprising, of course, since we can solve Equation (2) exactly. 
We have the following important theorem. 


Theorem 1. (a) Every solution x=(t) of (2) is stable if all the eigenvalues 
of A have negative real part. 
(b) Every solution x = (t) of (2) is unstable if at least one eigenvalue of A 
has positive real part. 
(c) Suppose that all the eigenvalues of A have real part <0 and \,= 
i0),...,A,= 0, have zero real part. Let \,=io, have multiplicity k,. This 
means that the characteristic polynomial of A can be factored into the form 


P(A)=(A- io,)""...(A—ioy)"Q(A) 
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where all the roots of q(A) have negative real part. Then, every solution 
x=(t) of (1) is stable if A has k, linearly independent eigenvectors for 
each eigenvalue h, = io,. Otherwise, every solution $(t) is unstable. 


Our first step in proving Theorem | is to show that every solution $(f) is 
stable if the equilibrium solution x(¢)=0 is stable, and every solution $(f) 
is unstable if x(¢)=0 is unstable. To this end, let Y(t) be any solution of 
(2). Observe that z(t)=(t)— (2) is again a solution of (2). Therefore, if 
the equilibrium solution x(t)=0 is stable, then z(t)=(t)— Y(t) will always 
remain small if z(0)=@(0)— ¥(0) is sufficiently small. Consequently, every 
solution (ft) of (2) is stable. On the other hand suppose that x(¢)=0 is un- 
stable. Then, there exists a solution x=h(¢) which is very small initially, 
but which becomes large as ¢ approaches infinity. The function y(t)=¢(f¢) 
+h(¢) is clearly a solution of (2). Moreover, y(t) is close to $(f) initially, 
but diverges from ¢$(f) as ¢ increases. Therefore, every solution x=¢(f) of 
(2) is unstable. 

Our next step in proving Theorem | is to reduce the problem of showing 
that n quantities y,(t), 7=1,...,2 are small to the much simpler problem of 

‘showing that only one quantity is small. This is accomplished by introduc- 
ing the concept of length, or magnitude, of a vector. 


Definition. Let 


x} 
X=| ; 
Xn 
be a vector with n components. The numbers x,,...,x, may be real or 


complex. We define the length of x, denoted by ||x|| as 


|x|] = max {|x,|,|2|,...5 [x] }. 
For example, if 
l 
X=/ 2], 
—3 
then ||x||=3 and if 
1+2i 
x= 2 
—] 


then ||x||/=V5. 


The concept of the length, or magnitude of a vector corresponds to the 
concept of the length, or magnitude of a number. Observe that ||x|| > 0 for 
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any vector x and ||x||=0 only if x=0. Second, observe that 
|Ax|| =max{|Axj|,....[Ax,1} =[Almax{|xy],---[%n]} = IAI 
Finally, observe that 
[Ix+y|] =max{|x, +y\|,-.-5]%, tal} <max{[xy]+[yil,--- [Xl +1 yal} 
< max{ |x], +5 a1} tmax{| yf +5 Yul} = IPXll + llyll- 


Thus, our definition really captures the meaning of length. 
In Section 4.7 we give a simple geometric proof of Theorem | for the 
case n=2. The following proof is valid for arbitrary n. 


PROOF OF THEOREM |. (a) Every solution x=y(t) of x= Ax is of the form 
v(t) = e™Y (0). Let ¢,(t) be the ij element of the matrix e“‘, and let 
y,...,~° be the components of ¥(0). Then, the ith component of y(?) is 


(1) =O (OW +... +O,(2WE=  ailous 


Suppose that all the eigenvalues of A have negative real part. Let — a, be 
the largest of the real parts of the eigenvalues of A. It is a simple matter to 
show (see Exercise 17) that for every number — a, with —a,< —a<0, we 
can find a number K such that |¢,,(2)| < Ke”, ¢ > 0. Consequently, 


Wi(2)|< 2D) Ke~™|y?|= Keo > |yy 
j=l j=1 
for some positive constants K and a. Now, Iw; < ||~(O)||. Hence, 
ly (2)l|=max{|y, (2)... 1¥,(O|} < 2Ke~™||h(0) |]. 
Let e>0 be given. Choose 6(¢) =e /nK. Then, ||P(t)Il <e if || Y(O)I| < 6(e) 
and ¢t=0, since 
Ip (t)|| < nKe~ || p(0)|| <nKe/nK =e. 


Consequently, the equilibrium solution x(t)=0 is stable. 

(b) Let A be an eigenvalue of A with positive real part and let v be an 
eigenvector of A with eigenvalue A. Then, ¥(t)=cev is a solution of x= 
Ax for any constant c. If d is real then v is also real and ||y¥(t)|| =|c|e*||v||. 
Clearly, ||¥(2)|| approaches infinity as ¢ approaches infinity, for any choice 
of c#0, no matter how small. Therefore, x(¢)=0 is unstable. If A\=a+iB 
is complex, then y=v! + iv’ is also complex. In this case 


efe+ iB) (yl + jy?) = e® (cos Bt + isin Bt)(v! + iv’) 
= ew (v' cos Bt —v’ sin Bt) + i(v! sin Bt + v? cos Bt) | 
is a complex-valued solution of (2). Therefore 
y'(t)=ce™ (v' cos Bt —v’ sin Br) 
is a real-valued solution of (2), for any choice of constant c. Clearly, 
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\|¥'(¢)|| is unbounded as ¢ approaches infinity if c and either v' or v’ is 
nonzero. Thus, x(t)=0 is unstable. 

(c) If A has k; linearly independent eigenvectors for each eigenvalue A; = 
io, of multiplicity k,, then we can find a constant K such that ce’) y|< K 
(see Exercise 18). There, ||W(2)|| < 2K ||~(0)|| for every solution (2) of (2). 
It now follows immediately from the proof of (a) that x(7)=0 is stable. 

On the other hand, if A has fewer than k, linearly independent eigenvec- 
tors with eigenvalue A, =io,, then x= Ax has solutions y(7) of the form 


y(t)= ce'%"| y+ 1(A—iol)v| 


where (A —io,I)v+0. If o,=0, then ¥(t)=c(v+ ¢Av) is real-valued. More- 
over, ||yY(7)|| 1s unbounded as ¢ approaches infinity for any choice of c¥0. 
Similarly, both the real and imaginary parts of p(t) are unbounded in mag- 
nitude for arbitrarily small ¥(0)#0, if 6, #0. Therefore, the equilibrium 
solution x(¢)=0 is unstable. O 


If all the eigenvalues of A have negative real part, then every solution 
x(t) of x=Ax approaches zero as ¢ approaches infinity. This follows im- 
mediately from the estimate ||x(z)|| < Ke °'||x(0)|| which we derived in the 
proof of part (a) of Theorem 1. Thus, not only is the equilibrium solution 
x(t)=0 stable, but every solution Y(t) of (2) approaches it as ¢ approaches 
infinity. This very strong type of stability is known as asymptotic stability. 


Definition. A solution x =¢(t) of (1) is asymptotically stable if it is stable, 
and if every solution (7) which starts sufficiently close to @(f) must ap- 
proach #(¢) as ¢ approaches infinity. In particular, an equilibrium solu- 
tion x(t)=x° of (1) is asymptotically stable if every solution x= (7) of 
(1) which starts sufficiently close to x° at time t=O not only remains 
close to x° for all future time, but ultimately approaches x° as ¢ ap- 
proaches infinity. 


Remark. The asymptotic stability of any solution x =(f) of (2) is clearly 
equivalent to the asymptotic stability of the equilibrium solution x(#)=0. 


Example 1. Determine whether each solution x(t) of the differential equa- 
tion 


is stable, asymptotically stable, or unstable. 
Solution. The characteristic polynomial of the matrix 


—] ) 0 
—-2 —-] 2 


—-3 -2 -1 
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—1-A 0 0 
p(A)=(detA—AI)=det; —-2 —1-A 2 
—3 —2 —1-A 


= —(1+A)?—4(1+A)= —(1+A)(A2+2A +5). 


Hence, A= — 1 and A= —1+2i are the eigenvalues of A. Since all three ei- 
genvalues have negative real part, we conclude that every solution of the 
differential equation x = Ax is asymptotically stable. 


Example 2. Prove that every solution of the differential equation 


._{1 5 

. (5 1s 
is unstable. 
Solution. The characteristic polynomial of the matrix 

(i 5) 

a=(5 

1S 
_ yh 1-A 5 \_ yy y)2_ 
p(A) = det(A — AI) = det 5 1d =(1—A)*—25. 


Hence \=6 and A= —4 are the eigenvalues of A. Since one eigenvalue of 
A is positive, we conclude that every solution x = @(¢) of x = Ax is unstable. 


Example 3. Show that every solution of the differential equation 
-_(0 —3 
. (5 0) 

is stable, but not asymptotically stable. 

Solution. The characteristic polynomial of the matrix 


_(0 -3 
a=($ 5) 
1S 
p(A)=det(A~Al) = det( ~ ~3)=r2+6. 


Thus, the eigenvalues of A are \= + V6 i. Therefore, by part (c) of Theo- 
rem 1, every solution x=@(t) of k= Ax is stable. However, no solution is 
asymptotically stable. This follows immediately from the fact that the gen- 
eral solution of x= Ax is 


x)= 4 —V6 sin VEU + (v6 cos V6 1) 
2cos V6 t 2sin V6 ¢ 
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Hence, every solution x(t) is periodic, with period 27/V6 , and no solu- 
tion x(t) (except x(t)=0) approaches 0 as ¢ approaches infinity. 


Example 4. Show that every solution of the differential equation 


is unstable. 
Solution. The characteristic polynomial of the matrix 


2 -3 0 
A=| 0 -6 2 
—6 0 -3 
1S 
2—X —3 0 
ro) dead 0 -6-A —2 |-- +n, 
—6 0 —-3-A 


Hence, the eigenvalues of A are A\= —7 and A=0. Every eigenvector v of A 
with eigenvalue 0 must satisfy the equation 

0 

0}. 

0 


2 —3 0); % 
0 -6 —-2{/%2)= 
—6 0 -—3)[% 


This implies that v, =3v,/2 and v,= —3v,, so that every eigenvector v of 
A with eigenvalue 0 must be of the form 


Consequently, every solution x =¢(t) of x= Ax is unstable, since A\=0 is an 
eigenvalue of multiplicity two and A has only one linearly independent e1- 
genvector with eigenvalue 0. 


EXERCISES 


Determine the stability or instability of all solutions of the following sys- 
tems of differential equations. 


- l l . ({(-3 —-4 
1. x=(_) _})x 2. i=( , t)x 
_{-5 3 _{1 —-4 
3. x (7 x 4. x (} “4)x 
—7 1 -6 3 2 4 
5.x=| 10 —4 12 |x 6 x=|2 0 2 |x 
2 -1 l 4 2 3 
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0 2 1 —2 l | 

7.x=[-1 -—-3 —-11{x 8. x=| -—3 2 31x 
l 1 —-Il 1 -l -2 
0 2 0 0 0 2 1 O 

._|—-2 0 0 0 ._|—-2 0 0 1 

9, x= 0 0 02 x 10. x 0 0 0 2 x 

0 0 -2 O 0 0 -2 O 
11. Determine whether the solutions x(#)=0 and x(t)=1 of the single scalar equa- 


tion x = x(1—<x) are stable or unstable. 


12. Determine whether the solutions x(t)=0 and x(t)=1 of the single scalar equa- 
tion x = — x(1— x) are stable or unstable. 
13. Consider the differential equation x = x*. Show that all solutions x(t) with x(0) 


14. 


15. 


16. 


17. 


> 0 are unstable while all solutions x(t) with x(0) <0 are asymptotically stable. 


Consider the system of differential equations 
X= X2 
X2= —$[ abt (x34 4x3)" |x. (*) 
(a) Show that 
=(20 -(gner? 
X7(t) c*cos(ct+ d) 


is a solution of (*) for any choice of constants c and d. 
(b) Assume that a solution x(f) of (*) is uniquely determined once x,(0) and 
x,(0) are prescribed. Prove that (a) represents the general solution of (*). 
(c) Show that the solution x=0 of (*) is stable, but not asymptotically stable. 
(d) Show that every solution x(t)=0 of (*) is unstable. 


Show that the stability of any solution x(t) of the nonhomogeneous equation 
x= Ax+f(r) is equivalent to the stability of the equilibrium solution x=0 of 
the homogeneous equation x = Ax. 


Determine the stability or instability of all solutions x(t) of the differential 


equation 
._{-l -1l l 
x= (7) Ty )e+(5} 


(a) Let f(t)=¢%e~", for some positive constants a and b, and let c be a posi- 
tive number smaller than b. Show that we can find a positive constant K 
such that | f(t)| < Ke~“, 0< t< oo. Hint: Show that f(t)/e “ approaches 
zero as ¢ approaches infinity. 

(b) Suppose that all the eigenvalues of A have negative real part. Show that we 
can find positive constants K and a such that |(e“’),| < Ke~™ for 1<i, 
j<n. Hint: Each component of e“ is a finite linear combination of func- 
tions of the form g(f)e*, where q(f) is a polynomial in ¢ (of degree < n—1) 
and A is an eigenvalue of A. 
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18. (a) Let x(t)=e'“v, o real, be a complex-valued solution of x= Ax. Show that 
both the real and imaginary parts of x(t) are bounded solutions of x= Ax. 
(b) Suppose that all the eigenvalues of A have real part <0 and A, =io),...,A, 
= io, have zero real part. Let A;=i0, have multiplicity k,, and suppose that 
A has k; linearly independent eigenvectors for each eigenvalue A,, j= 1,...,/. 
Prove that we can find a constant K such that |(e4‘),| < K. 


19. Let 


x 


and define ||x||,;=|x,|+... +|x,|. Show that 
(1) ||x||,; =O and ||x||,;=0 only if x=0 


(ii) |JAX|], =[ATIIXIh1 
(ut) |[x+y|]1< [xl] +Mlylhi 
20. Let 
x| 
x={ : 
x 


and define ||x||,=[|x,|?+ ... +|x,|7]!/2. Show that 
(i) ||x||2>0 and ||x||,=0 only if x=0 

(ii) ||Ax|]2=|AIIIx|l2 

(iii) ||x+y|l2 < [|xll2 + llyll2 


21. Show that there exist constants M and N such that 
M ||x||1 < ||X|l2 < N [[x||1.- 


4.3 Stability of equilibrium solutions 


In Section 4.2 we treated the simple equation x=Ax. The next simplest 
equation is 


x= Ax + g(x) (1) 

where 

£1(x) 

g(x)=| : 

8,,(X) 

is very small compared to x. Specifically we assume that 
8, (x) &,,(x) 
max{|x,|,...,|x,|} max{|x,|,...,|x,]} 
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are continuous functions of x,,...,x, Which vanish for x,=...= x, =0. 
This is always the case if each component of g(x) is a polynomial in 
X}>+..,X, Which begins with terms of order 2 or higher. For example, if 


then both x,x}/max{|x,|, |x|} and x,x,/max{|x,|,|x,|} are continuous 
functions of x,,x, which vanish for x, = x,=0. 

If g(0)=0 then x(t)=0 is an equilibrium solution of (1). We would like 
to determine whether it is stable or unstable. At first glance this would 
seem impossible to do, since we cannot solve Equation (1) explicitly. How- 
ever, if x is very small, then g(x) is very small compared to Ax. Therefore, 
it seems plausible that the stability of the equilibrium solution x(t)=0 of 
(1) should be determined by the stability of the “approximate” equation 
x = Ax. This is almost the case as the following theorem indicates. 


Theorem 2. Suppose that the vector-valued function 


g(x)/||x||=e(x)/max{|x)],...,],1} 
is a continuous function of x,,...,X, which vanishes for x=0. Then, 
(a) The equilibrium solution x(t)=0 of (1) is asymptotically stable if the 
equilibrium solution x(t)=0 of the “linearized” equation x = Ax is asymp- 
totically stable. Equivalently, the solution x(t)=0 of (1) is asymptotically 
stable if all the eigenvalues of A have negative real part. 
(b) The equilibrium solution x(t)=0 of (1) is unstable if at least one eigen- 
value of A has positive real part. 
(c) The stability of the equilibrium solution x(t)=0 of (1) cannot be de- 
termined from the stability of the equilibrium solution x(t)=0 of x= Ax if 
all the eigenvalues of A have real part <0 but at least one eigenvalue of A 
has zero real part. 


Proor. (a) The key step in many stability proofs is to use the variation of 
parameters formula of Section 3.12. This formula implies that any solution 
x(t) of (1) can be written in the form 


x(t) = e4"x(0) + [ ‘eMC Dg(x(s)) ds. (2) 


We wish to show that ||x(t)|| approaches zero as t approaches infinity. To 
this end recall that if all the eigenvalues of A have negative real part, then 
we can find positive constants K and a such that (see Exercise 17, Section 
4,2). 


l|e“’x(0)|| < Ke~°*||x(0)| 
and 


| eAl—5)g(x(s)) \| < Ke ~a—S)iig(x(s))|l- 
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Moreover, we can find a positive constant o such that 
sx <Zelxll i [Ix <o 


This follows immediately from our assumption that g(x)/||x|| is continuous 
and vanishes at x=0. Consequently, Equation (2) implies that 


x) < eMxO+ f eA Dg(x(s)) || ds 


t 
Ke“ |jx(O)l)+ 5 fee s)Ih 4 


as long as ||x(s)|| <0, 0< s<t. Multiplying both sides of this inequality by 
e*' gives 


t 
e*|x(2)II < KIO) + F [ e%*|Ix(s)II ds. (3) 
0 
The inequality (3) can be simplified by setting z(t) =e ||x(2)||, for then 


2(1)< Kix +5 f '2(s) ds. (4) 


We would like to differentiate both sides of (4) with respect to t. However, 
we cannot, in general, differentiate both sides of an inequality and still pre- 
serve the sense of the inequality. We circumvent this difficulty by the 
clever trick of setting 


U(t)= 5 f 2(s) 4s 
Then 
dU(t) _ 
dt 


72(t)< 5 K\|x(O)|l + 5 5 U(t) 


or 
dU (t) 
GGUS F KOI. 
Multiplying both sides of this inequality by the integrating factor e 
gives 


—at/2 


ad e 7 at/2 <t —at/2 

6 8U < S& |ix(O)|jen ?, 
or 

© e-#L U(1)+ K\jx(0)||] <0 
Consequently, 


e~*/2] U(t) + K||x(0)|| ] < U(0) + K||x(0)|| = K|Ix(0) IL, 
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so that U(t)< — K||x(0)|| + K||x(0)|je/. Returning to the inequality (4), 
we see that 


Ix(2)|| = e7 Z(t) < e™| K||x(O)||+ U(D) 
< K||x(0) Je"? (5) 


as long as ||x(s)|| <0, 0<s<t. Now, if ||x(0)||<o/K, then the inequality 
(5) guarantees that ||x(z)|| < o “tor all future time ¢. Consequently, the in- 
equality (5) is true for all +20 if ||x(Q)||<o/K. Finally, observe from (5) 
that ||x(z)|| < K||x(0)|| and ||x(2)|| approaches zero as ¢ approaches infinity. 
Therefore, the equilibrium solution x(t)=0 of (1) is asymptotically stable. 
(b) The proof of (b) is too difficult to present here. 

(c) We will present two differential equations of the form (1) where the 
nonlinear term g(x) determines the stability of the equilibrium solution x(¢) 
=0. Consider first the system of differential equations 

dx dx 
= = X_— xX, (xf + x3), 7 = — X)— X2(x7+ x3). (6) 


The linearized equation is 
ai(xs)-(9 o)lss) 
dt —] O/\%2/ 
and the eigenvalues of the matrix 
(_t 9) 
—-1 0 


are +i. To analyze the behavior of the nonlinear system (6) we multiply 
the first equation by x,, the second equation by x, and add; this gives 


dx, dx, 
1 teG =H T(x} + x3) — xZ(xT + x9) 
=—(xji+ x2)" 
But 
x) “1 +X, “2 = 5 £ (x}+ x3) 
Hence, 


Ht 4 (x2 + x3) = = —2(x} + x3)’. 


This implies that 


c 
x}(t)+x3()= 5 —, 
where 
c= x} (0) + x} (0). 
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Thus, x?(¢)+ x(t) approaches zero as ¢ approaches infinity for any solu- 
tion x,(t), x2(t) of (6). Moreover, the value of x?+ x} at any time f is al- 
ways less than its value at t=0. We conclude, therefore, that x,(t)=0, x,(t) 
=0 is asymptotically stable. 

On the other hand, consider the system of equations 


dx, dx, 
Got Xa t mi (xt + x2); Go ala + 3); (7) 
Here too, the linearized system is 
.-_{ O | 
x ( _1 o}* 
In this case, though, (d/dt)(x? + x3) =2(x?+ x3)*. This implies that 
C 
xi(t)+393()=7—5zU> = x1 (0) + x20). 


Notice that every solution x,(t), x,(f) of (7) with x?(0)+ x3(0)40 ap- 
proaches infinity in finite time. We conclude, therefore, that the equi- 
librium solution x,(t)=0, x,(#)=0 is unstable. 0 


Example 1. Consider the system of differential equations 


dx, ; 
—— = — 2x, +x,4+3x34+9x5 
dt 
dx, ; 
“dt. = — 6x, —5x34+7x3 
dx, 

= 2 
Tr —XxX3+ x7 + x3. 


Determine, if possible, whether the equilibrium solution x,(t)=0, x,()=0, 
x,(t)=0 is stable or unstable. 
Solution. We rewrite this system in the form x= Ax +(x) where 


3 
x) —2 l 3 9x2 
x=|%)],  A=/| 0 -6 -—5| and g(x)=| 7x3 
x _ 
3 0 0 l x2+ x? 


The function g(x) satisfies the hypotheses of Theorem 2, and the eigenval- 
ues of A are —2, —6 and —1. Hence, the equilibrium solution x(t)=0 is 
asymptotically stable. 


Theorem 2 can also be used to determine the stability of equilibrium 
solutions of arbitrary autonomous differential equations. Let x° be an 
equilibrium value of the differential equation 


x = f(x) (8) 
389 


4 Qualitative theory of differential equations 


and set z(f)=x(t)—x°. Then 
z=x=f(x’+z). (9) 
Clearly, z(t)=0 is an equilibrium solution of (9) and the stability of x()= 
x° is equivalent to the stability of z(t)=0. 
Next, we show that f(x°+z) can be written in the form f(x°+z)=Az+ 
g(z) where g(z) is small compared to z. 


Lemma 1. Let f(x) have two continuous partial derivatives with respect to 

each of its variables x,,...,x,. Then, {(x°+z) can be written in the form 

f(x° +z) =f(x°) + Az+g(z) (10) 

where g(z)/max{|z,|,...,|Z,|} is @ continuous function of z which vanishes 
for z=0. 


Proor #1. Equation (10) is an immediate consequence of Taylor’s Theo- 
rem which states that each component §(x° +z) of f(x°+z) can be written 
in the form 


f(x +z) =f (x°)+ ue 224 t + g,(z) 


where g,(z)/max{|Z,|,...,|Z,|} is a continuous function of z which vanishes 


for z=0. Hence, 
f(x° +z) =f(x°) +Az+g(z) 


where 
of; (x°) af, (x°) 
0x, 0X, 
Of, (x°) of, (x°) 
Ox, OX,, 


PROOF #2. If each component of f(x) is a polynomial (possibly infinite) in 


Xj9+++)X,, then each component of f(x°+z) is a polynomial in z,,...,Z,,. 
Thus, 
G(x +z) =ay+a)2, +... +4,2,+8,(Z) (11) 


where g((z) is a polynomial in z,,...,z, beginning with terms of order two. 
Setting z=0 in (11) gives f(x°) = ap. ” Hence, 


Qi oee Qin 
f(x°+z)=f(x°)+Az+g(z), A= 
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and each component of g(z) is a polynomial in Z,,...,z, beginning with 
terms of order two. oO 


Theorem 2 and Lemma | provide us with the following algorithm for 
determining whether an equilibrium solution x(t)=x° of x=f(x) is stable 
or unstable: 


1. Set z=x—-x®. 

2. Write f(x°+z) in the form Az+g(z) where g(z) is a vector-valued poly- 
nomial in Z,,...,Z, beginning with terms of order two or more. 

3. Compute the eigenvalues of A. If all the eigenvalues of A have negative 
real part, then x(t)=x° is asymptotically stable. If one eigenvalue of A 
has positive real part, then x(#)=x° is unstable. 


Example 2. Find all equilibrium solutions of the system of differential 
equations 


and determine (if possible) whether they are stable or unstable. 
Solution. The equations 1— xy =0 and x—y°=0 imply that x=1, y=1 or 
x=-—1,y=-1. Hence, x()=1, y(O=1, and x(4)= — 1, y\()= — 1 are the 
only equilibrium solutions of (12). 

(i) x()=1, yO=1: Set uw=x—1, v=y—1. Then, 


du _ ax __ 
a da —(1+u)(1+0) u—v—Uv 
do? a1 +u)- (1+ 0) =u—3v0—-30?- 0’. 


We rewrite this system in the form 
du (~] “3 e)-( uv 
alo)“( —3)(o)-( 32409} 
(oT 23) 
1 —3 


has a single eigenvalue A= —2 since 


The matrix 


-1-A —] 
det ; 51 JRCFNG+A)+1=A+2/. 
Hence, the equilibrium solution x(t)=1, y()=1 of (12) is asymptotically 
stable. 

(i) x()= —1, y\O= — 1: Set u=x+1, v=y+1. Then, 


eed —(u-1)(o—-1l)=u+v0—wuw 
d 
@ = = (u- 1)—(v—1)P =u—304+30?-v?. 
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We rewrite this system in the form 


ae)"(1 —3)()* (202-02) 
The eigenvalues of the matrix 
(; 3) 
1 -3 
are A, = —1— V5, which is negative, and A, = —1+ V5 , which is positive. 


Therefore, the equilibrium solution x(t)= — 1, y()= — 1 of (12) 1s unsta- 
ble. 


Example 3. Find all equilibrium solutions of the system of differential 
equations 


ax —. Dd _ x 
i =sin(x+y), a? l (13) 


and determine whether they are stable or unstable. 

Solution. The equilibrium points of (13) are determined by the two equa- 
tions sin(x + y)=0 and e*—1=0. The second equation implies that x =0, 
while the first equation implies that x + y=nz7, n an integer. Consequently, 
x(H=0, y(N=nz7, n=0,+1, +2,..., are the equilibrium solutions of (13). 
Setting u=x, v=y—nm7, gives 


du _ do _ ju_ 
ai =sin(u+o+tnmz), ae I. 
Now, sin(u + vo + n7)=cosnzsin(u + v)=(— 1)"sin(u+ v). Therefore, 
du —(.1\"¢: dv _ “u__ 
a 1) sin(u+ v), ae l. 
Next, observe that 
3 
ut vo 2 
sin(utv)=uto—< 31 +..., e“—l=ut 57+ 
Hence, 
| 3 
du _ n (u+v) dv _ u? 
nt 1) | (u+v) 3 Ct | an ttat 


We rewrite this system in the form 


g(u-(Co (1) \(x)+ terms of order 2 or 
é l 0 higher in u and v. 


The eigenvalues of the matrix 


((-" co") 
l 0 
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are 


—_ (-1)"-yi+4(-1)" _ (-1)"+y1+4(-1)" 


2 yO 2 


When n is even, \, =(1— V5 )/2 is negative and A,=(1+ V5 )/2 is posi- 
tive. Hence, x(t)=0, y(t)=nrz is unstable if n is even. When n is odd, both 
\, =(—1- V3 i)/2 and A, =(—1+ V3 i)/2 have negative real part. There- 
fore, the equilibrium solution x(t)=0, y()=nz is asymptotically stable if 
nis odd. 


EXERCISES 


Find all equilibrium solutions of each of the following systems of equa- 
tions and determine, if possible, whether they are stable or unstable. 


1. x=x—-—x?—xy? 2. x=x*+y?-1 3. x=x*+y?—-] 
y=2y—y?— yx" yex?—y? yo2xy 

4. X=6x —6x*—2xy 5. x=tan(x+y) 6. x=e”—x 
y=4y —4y2-2xy yextx? y=er*ty 


Verify that the origin is an equilibrium point of each of the following sys- 
tems of equations and determine, if possible, whether it is stable or unsta- 
ble. 


7. X=yt3x? 8. x=y+cosy—1 9. x=e*t¥—] 
y=x—3y? y=-sinx+x? y=sin(x + y) 

10. x=In(1+ x+y’) 11. x=cosy—sinx—1 
yer-ytx yox-y-y? 

12, x=8x-—3y+e"-1 13, x= —x—y—(x?+y?3/? 
y=sinx? —In(1— x—y) y=x—yt(x?+y7y/? 

14. x=x-y+2z? 15. x=e*trt7-] 
y=ytz—x? y=sin(x+y +z) 
z=z—-x+y" z=x-y-2z" 

16. x=In(1 — z) 17. x=x—cosy—ztl 
y=In(1— x) y=y—cosz—xt+l 
z=In(1—y) Z=z—cosx—yrtl 


18 (a) Find all equilibrium solutions of the system of differential equations 


Y__¢ 
dt atbx 


aX gy _ @ _ gy 
a © hx, ky, dt ey fz. 


(This system is a model for the control of protein synthesis.) 
(b) Determine the stability or instability of these solutions if either g, e, or c is 
zero. 
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4.4 The phase-plane 


In this section we begin our study of the “geometric” theory of differentia: 
equations. For simplicity, we will restrict ourselves, for the most part, tc 
the case n=2. Our aim is to obtain as complete a description as possible of 
all solutions of the system of differential equations 


& =f (x,y), of = 9(x,y). (1) 


To this end, observe that every solution x= x(t), y=y(Z) of (1) defines a 
curve in the three-dimensional space ¢,x,y. That is to say, the set of all 
points (¢,x(t),y(t)) describe a curve in the three-dimensional space ¢, x,y. 
For example, the solution x =cost, y=sint of the system of differential 
equations 


describes a helix (see Figure 1) in (t,x,y) space. 

The geometric theory of differential equations begins with the important 
observation that every solution x= x(t), y=y(t), tp) <t<t,, of (1) also de- 
fines a curve in the x—y plane. To wit, as ¢ runs from f) to ¢,, the set of 
points (x(t),y(4)) trace out a curve C in the x—y plane. This curve is 
called the orbit, or trajectory, of the solution x = x(f), y=y(t), and the x —y 
plane is called the phase-plane of the solutions of (1). Equivalently, we can 
think of the orbit of x(t), y(t) as the path that the solution traverses in the 
x—y plane. 


Figure 1. Graph of the solution x =cost, y=sint 


Example 1. It is easily verified that x =cost, y=sint¢ is a solution of the 
system of differential equations x = —y, y= x. As ¢ runs from 0 to 27, the 
set of points (cost,sin#) trace out the unit circle x?+y?=1 in the x—y 
plane. Hence, the unit circle x?+y?=1 is the orbit of the solution 
x=cost, y=sint, 0<t<27. As ¢ runs from 0 to oo, the set of points 
(cost,sin?¢) trace out this circle infinitely often. 
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Example 2. It is easily verified that x =e ‘cost, y=e ‘sint, —o0<t<oo, 
is a solution of the system of differential equations dx /dt=—x-—y, dy/dt 
=x-—y. As ¢t runs from —oo to oo, the set of points (e ‘cost,e ‘sin#) 
trace out a spiral in the x—y plane. Hence, the orbit of the solution 
x=e ‘cost, y=e ‘sint is the spiral shown in Figure 2. 


y 


Figure 2. Orbit of x=e~ ‘cost, y=e‘sint 


Example 3. It is easily verified that x =31+2, y=5t+7, —a<t<ooisa 
solution of the system of differential equations dx /dt=3, dy /dt=5. As t 
runs from — oo to oo, the set of points (3+2,5¢+7) trace out the straight 
line through the point (2,7) with slope 2. Hence, the orbit of the solution 
x=3t+2, y=5t+7 is the straight line y=2(x—2)+7, —0<x<oo. 


Example 4. It is easily verified that x =31?+2, y=52?+7, 0<t<o isa 
solution of the system of differential equations 


x =6[(y—-7)/5]'7, é =10{(x—2)/3]'”. 

All of the points (317 +2,51*+7) lie on the line through (2,7) with slope 2. 
However, x is always greater than or equal to 2, and y is always greater 
than or equal to 7. Hence, the orbit of the solution x =317+2, y=5t7+7, 
0<1<oo, is the straight line y=3(x—2)+7, 2<x<0o. 


Example 5. It is easily verified that x =31+2, y=51°+7, -o<t<oo, isa 
solution of the system of differential equations 


ax _ 


ax _ 3 2 DY _ 10, 
Ht y 9 (* 2) —4, (x—2). 


395 


4 Qualitative theory of differential equations 


The orbit of this solution is the set of all points (x,y)=(3¢+2,517+7). 
Solving for t= 5(x —2), we see that y = 2(x —2)?+7. Hence, the orbit of the 
solution x =31+2, y=5t?+7 is the parabola y = 2(x—2)°+7, |x|<0o. 


One of the advantages of considering the orbit of the solution rather 
than the solution itself is that is is often possible to obtain the orbit of a 
solution without prior knowledge of the solution. Let x = x(t), y=y(t) bea 
solution of (1). If x’(t) is unequal to zero at t=1,, then we can solve for 
t=t(x) in a neighborhood of the point x, = x(¢,) (see Exercise 4). Thus, for 
t near ¢,. the orbit of the solution x(¢), y(¢) is the curve y=y(t(x)). Next, 
observe that 


ee <i ————— —— 
— ome — 


dx dt dx dx/dt f(x,y) 


Thus, the orbits of the solutions x= x(t), y=y(t) of (1) are the solution 
curves of the first-order scalar equation 


dy 8(x,y) 
ae (2) 
x f(x,y) 
Therefore, it 1s not necessary to find a solution x(¢), y(2) of (1) in order to 


compute its orbit; we need only solve the single first-order scalar differen- 
tial equation (2). 


Remark. From now on, we will use the phrase “the orbits of (1)” to denote 
the totality of orbits of solutions of (1). 


Example 6. The orbits of the system of differential equations 


dx 2 dy 2 (3) 


are the solution curves of the scalar equation dy / dx = x?/y?. This equation 
is separable, and it is easily seen that every solution is of the form y(x)= 
(x? —c)!/3, c constant. Thus, the orbits of (3) are the set of all curves 
ya(x?—o)'”, 


Example 7. The orbits of the system of differential equations 


& a y(14x2+y?), O = ~2x(14x2-+y2) (4) 
are the solution curves of the scalar equation 

dy 2x(1+x*+y7) 2x 

dx (Itty?) 
This equation is separable, and all solutions are of the form $y?+x?=c’. 
Hence, the orbits of (4) are the families of ellipses $y? + x? = c?. 
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Warning. A solution curve of (2) is an orbit of (1) only if dx /dt and dy /dt 
are not zero simultaneously along the solution. If a solution curve of (2) 
passes through an equilibrium point of (1), then the entire solution curve is 
not an orbit. Rather, it is the union of several distinct orbits. For example, 
consider the system of differential equations 


deeyiivy), Fey 


dx dx/dt ~ y 


are the family of concentric circles x* + y?=c*. Observe, however, that ev- 
ery point on the unit circle x*+y?= 1 is an equilibrium point of (5). Thus, 
the orbits of this system are the circles x*+ y*=c?’, for c#1, and all points 
on the unit circle x?+y?=1. Similarly, the orbits of (3) are the curves y = 
(x? —c)'/, c#0; the half-lines y= x, x >0, and y= x, x <0; and the point 
(0,0). 

It is not possible, in general, to explicitly solve Equation (2). Hence, we 
cannot, in general, find the orbits of (1). Nevertheless, it is still possible to 
obtain an accurate description of all orbits of (1). This is because the sys- 
tem of differential equations (1) sets up a direction field in the x —y plane. 
That is to say, the system of differential equations (1) tells us how fast a 
solution moves along its orbit, and in what direction it is moving. More 
precisely, let x = x(t), y=y(t) be a solution of (1). As ¢ increases, the point 
(x(t),y(t)) moves along the orbit of this solution. Its velocity in the x-di- 
rection is dx /dt; its velocity in the y-direction is dy /dt; and the magnitude 
of its velocity is [((dx(t)/dt)’ +(dy(t)/dty’]'/*. But dx(t)/dt=f(x(t), y(O), 
and dy(t)/dt = g(x(t), y(t)). Hence, at each point (x,y) in the phase plane 
of (1) we know (i), the tangent to the orbit through (x,y) (the line through 
(x, y) with direction numbers f(x, y), g(x, y) respectively) and (11), the 
speed [ f2(x, y)+ g7(x, y)]!”* with which the solution is traversing its orbit. 
As we shall see in Sections 4.8—13, this information can often be used to 
deduce important properties of the orbits of (1). 

The notion of orbit can easily be extended to the case n >2. Let x=x(f) 
be a solution of the vector differential equation 


x} Fy (X1,--+5%,) 
x =f(x), X=/ : |, f(x) = (6) 


Xn f, (0), -0-5%,) 


on the interval tj <¢<7¢,. As ¢ runs from fy to ¢,, the set of points 
(x,(¢),...,x,(¢)) trace out a curve C in the n-dimensional space 
X1,Xz,...,X,. This curve is called the orbit of the solution x=x(?), for to < 
<¢,, and the n-dimensional space x,,...,x, is called the phase-space of the 
solutions of (6). 
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EXERCISES 


In each of Problems 1-3, verify that x(t), y(t) is a solution of the given sys- 
tem of equations, and find its orbit. 


1 %=1, y=2(1—x)sin(1— x) 
x()=1+t, y(t)=cost? 


x e*~—1 


2.xX=e *, yHe 
x(N=In(l+), yo=e' 

3. x=1+x7, y=(1+x7)sec?x 
x(f)=tant, y(t)=tan(tan 2) 


4. Suppose that x’(t;) +0. Show that we can solve the equation x= x(f) for t= 
t(x) in a neighborhood of the point x, = x(t,). Hint: If x’(t,) 40, then x(¢) is a 
strictly monotonic function of ¢ in a neighborhood of t=). 


Find the orbits of each of the following systems. 


5, x=y, 6. x=y(1+x?+y’), 
y=-x y= —x(14+x74+y7) 

7. x=y(1+x+y), 8. x=y+xy, 
y=—-x(1l+x+y) y=3xt xy? 

9. x= xye~3*, 10. x=4y, 

11. x=ax— bxy, 12. x=x?+cosy, 
y=cx — dxy y= dry 
(a,b,c,d positive) 

13. x=2xy, 14. x=y+sinx, 
yox?—y? y=x—ycosx 


4.5 Mathematical theories of war 


4.5.1. L. F. Richardson’s theory of conflict 


In this section we construct a mathematical model which describes the re- 
lation between two nations, each determined to defend itself against a pos- 
sible attack by the other. Each nation considers the possibility of attack 
quite real, and reasonably enough, bases its apprehensions on the readiness 
of the other to wage war. Our model is based on the work of Lewis Fry 
Richardson. It is not an attempt to make scientific statements about fore- 
ign politics or to predict the date at which the next war will break out. 
This, of course, is clearly impossible. Rather, it is a description of what 
people would do if they did not stop to think. As Richardson writes: “Why 
are sO many nations reluctantly but steadily increasing their armaments as 
if they were mechanically compelled to do so? Because, I say, they follow 
their traditions which are fixtures and their instincts which are mechanical; 
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and because they have not yet made a sufficiently strenuous intellectual 
and moral effort to control the situation. The process described by the en- 
suing equations is not to be thought of as inevitable. It is what would occur 
if instinct and tradition were allowed to act uncontrolled.” 

Let x = x(t) denote the war potential, or armaments, of the first nation, 
which we will call Jedesland, and let y(t) denote the war potential of the 
second nation, which we will call Andersland. The rate of change of x(¢) 
depends, obviously, on the war readiness y(t) of Andersland, and on the 
grievances that Jedesland feels towards Andersland. In the most simplistic 
model we represent these terms by ky and g respectively, where k and g 
are positive constants. These two terms cause x to increase. On the other 
hand, the cost of armaments has a restraining effect on dx /dt. We repre- 
sent this term by — ax, where a is a positive constant. A similar analysis 
holds for dy /dt. Consequently, x = x(t), y=y(d) is a solution of the linear 
system of differential equations 


dx _, | dq _ 
an’ ax + g, a By +h. (1) 


Remark. The model (1) is not limited to two nations; it can also represent 
the relation between two alliances. For example, Andersland and Jedes- 
land can represent the alliances of France with Russia, and Germany with 
Austria-Hungary during the years immediately prior to World War I. 


Throughout history, there has been a constant debate on the cause of 
war. Over two thousand years ago, Thucydides claimed that armaments 
cause war. In his account of the Peloponnesian war he writes: “The real 
though unavowed cause I believe to have been the growth of Athenian 
power, which terrified the Lacedaemonians and forced them into war.” Sir 
Edward Grey, the British Foreign Secretary during World War I agrees. 
He writes: “The increase of armaments that is intended in each nation to 
produce consciousness of strength, and a sense of security, does not pro- 
duce these effects. On the contrary, it produces a consciousness of the 
Strength of other nations and a sense of fear. The enormous growth of 
armaments in Europe, the sense of insecurity and fear caused by them—it 
was these that made war inevitable. This is the real and final account of 
the origin of the Great War.” 

On the other hand, L. S. Amery, a member of Britain’s parliament 
during the 1930’s vehemently disagrees. When the opinion of Sir Edward 
Grey was quoted in the House of Commons, Amery replied: “With all due 
respect to the memory of an eminent statesman, I believe that statement to 
be entirely mistaken. The armaments were only the symptoms of the con- 
flict of ambitions and ideals, of those nationalist forces which created the 
War. The War was brought about because Serbia, Italy and Rumania 
passionately desired the incorporation in their states of territories which at 
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that time belonged to the Austrian Empire and which the Austrian govern- 
ment was not prepared to abandon without a struggle. France was pre- 
pared, if the opportunity ever came, to make an effort to recover Alsace— 
Lorraine. It was in those facts, in those insoluble conflicts of ambitions, 
and not in the armaments themselves, that the cause of the War lay.” 

The system of equations (1) takes both conflicting theories into account. 
Thucydides and Sir Edward Grey would take g and h small compared to k 
and /, while Mr. Amery would take k and / small compared to g and h. 

The system of equations (1) has several important implications. Suppose 
that g and h are both zero. Then, x(¢)=0, y(t)=0 is an equilibrium solu- 
tion of (1). That is, if x, y, g, and h are all made zero simultaneously, then 
x(t) and y(t) will always remain zero. This ideal condition is permanent 
peace by disarmament and satisfaction. It has existed since 1817 on the 
border between Canada and the United States, and since 1905 on the 
border between Norway and Sweden. 

These equations further imply that mutual disarmament without satis- 
faction is not permanent. Assume that x and y vanish simultaneously at 
some time f= fp. At this time, dx/dt=g and dy /dt=h. Thus, x and y will 
not remain zero if g and h are positive. Instead, both nations will rearm. 

Unilateral disarmament corresponds to setting y =O at a certain instant 
of time. At this time, dy/dt=Ilx+h. This implies that y will not remain 
zero if either h or x is positive. Thus, unilateral disarmament is never per- 
manent. This accords with the historical fact that Germany, whose army 
was reduced by the Treaty of Versailles to 100,000 men, a level far below 
that of several of her neighbors, insisting on rearming during the years 
1933-36. 

A race in armaments occurs when the “defense” terms predominate in 
(1). In this case, 


dx _ dy _ 
a gy (2) 


Every solution of (2) is of the form 


x(t) = AeV! "+ BeW kt yQ=Vz [ AeVH!#— Be~ Ve +), 


Therefore, both x(t) and y(¢) approach infinity if A is positive. This infin- 
ity can be interpreted as war. 

Now, the system of equations (1) is not quite correct, since it does not 
take into effect the cooperation, or trade, between Andersland and Jedes- 
land. As we see today, mutual cocperation between nations tends to de- 
crease their fears and suspicions. We correct our model by changing the 
meaning of x(t) and y(t); we let the variables x(¢) and y(?f) stand for 
“threats” minus “cooperation.” Specifically, we set x = U-— U, and y= 
V—Vpo, where U is the defense budget of Jedesland, V is the defense 
budget of Andersland, U, is the amount of goods exported by Jedesland to 


400 


4.5 Mathematical theories of war 


Andersland and V, is the amount of goods exported by Andersland to 

Jedesland. Observe that cooperation evokes reciprocal cooperation, just as 

armaments provoke more armaments. In addition, nations have a tendency 

to reduce cooperation on account of the expense which it involves. Thus, 

the system of equations (1) still describes this more general state of affairs. 
The system of equations (1) has a single equilibrium solution 


_ kh+ bg _ _ Igt+ah 3 
~ aB—-kl’? ~*~ %° aB—kl (3) 


if a8 —ki#0. We are interested in determining whether this equilibrium 
solution is stable or unstable. To this end, we write (1) in the form w= Aw 
+f, where 


wo (A) f=( 7) and a=(~4 hy 


The equilibrium solution is 


X= Xo 


Xo 
w=w=(5°), 


where Aw, +f=0. Setting z=Ww —w,, we obtain that 
z=w=Awtf=A(z+w,)+f=Az+ Awyt+ f= Az. 


Clearly, the equilibrium solution w(¢)=w, of w=Aw+f is stable if, and 
only if, z=0 is a stable solution of z= Az. To determine the stability of z= 
0 we compute 


p(d)=det( Tan; _ fy) TNH (at BYA+ B= ke 


The roots of p(A) are 


lat B)* [Cat BY AaB KI)” 


2 
5 1/2 
—(a+B)+|(a-B)+4ki] 
=? 

Notice that both roots are real and unequal to zero. Moreover, both roots 
are negative if a8 — kl >0, and one root is positive if aB — ki<0. Thus, z(t) 
=0, and consequently the equilibrium solution x(t)= Xp, y()= yo is stable 
if a8 —-kl >0 and unstable if aB— k/<0. 

Let us now tackle the difficult problem of estimating the coefficients a, 
B, k, 1, g, and h. There is no way, obviously, of measuring g and h. How- 
ever, it is possible to obtain reasonable estimates for a, B, k, and /. Observe 
that the units of these coefficients are reciprocal times. Physicists and en- 
gineers would call a~' and B ~! relaxation times, for if y and g were identi- 
cally zero, then x(t)=e~*"~x(t)). This implies that x(tj+a7')= 
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x(t ))/e. Hence, a~' is the time required for Jedesland’s armaments to be 
reduced in the ratio 2.718 if that nation has no grievances and no other na- 
tion has any armaments. Richardson estimates a~! to be the lifetime of 
Jedesland’s parliament. Thus, a =0.2 for Great Britain, since the lifetime of 
Britain’s parliament is five years. 

To estimate k and / we take a hypothetical case in which g=0 and y= 
y,, So that dx /dt=ky,—ax. When x=0, 1/k=y,/(dx/dt). Thus, 1/k is 
the time required for Jedesland to catch up to Andersland provided that (i) 
Andersland’s armaments remain constant, (ii) there are no grievances, and 
(111) the cost of armaments doesn’t slow Jedesland down. Consider now the 
German rearmament during 1933-36. Germany started with nearly zero 
armaments and caught up with her neighbors in about three years. Assum- 
ing that the slowing effect of a nearly balanced the Germans’ very strong 
grievances g, we take k=0.3 (year)~' for Germany. Further, we observe 
that & is obviously proportional to the amount of industry that a nation 
has. Thus, k =0.15 for a nation which has only half the industrial capacity 
of Germany, and k=0.9 for a nation which has three times the industrial 
capacity of Germany. 

Let us now check our model against the European arms race of 
1909-1914. France was allied with Russia, and Germany was allied with 
Austria-Hungary. Neither Italy or Britain was in a definite alliance with 
either party. Thus, let Jedesland represent the alliance of France with 
Russia, and let Andersland represent the alliance of Germany with 
Austria-Hungary. Since these two alliances were roughly equal in size we 
take k=/, and since each alliance was roughly three times the size of 
Germany, we take k=/=0.9. We also assume that a= 8 =0.2. Then, 


o = axthy +g, @ a kx—ay +h (4) 
Equation (4) has a unique equilibrium point 
_ kh+ag _ kg+ah 
nO 2 ke? %o™ a2 — k2- 


This equilibrium is unstable since 
aB — kl=a?—k*=0.04—0.81 = —0.77. 


This, of course, is in agreement with the historical fact that these two alli- 
ances went to war with each other. 

Now, the model we have constructed is very crude since it assumes that 
the grievances g and h are constant in time. This is obviously not true. The 
grievances g and h are not even continuous functions of time since they 
jump instantaneously by large amounts. (It’s safe to assume, though, that g 
and h are relatively constant over long periods of time.) In spite of this, the 
system of equations (4) still provides a very accurate description of the 
arms race preceding World War I. To demonstrate this, we add the two 
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equations of (4) together, to obtain that 
d 
UX tyyalk—aylxty) tata. (5) 


Recall that x= U— U, and y=V—V,, where U and V are the defense 
budgets of the two alliances, and U,) and V, are the amount of goods ex- 
ported from each alliance to the other. Hence, 


<(U+ v)=(k-a)| U+ V—|Uo+ Vo- get tf (uy r»)]|. 


The defense budgets for the two alliances are set out in Table I. 


Table 1. Defense budgets expressed in millions of £ ster- 


ling 

1909 1910 1911 1912 1913 
France 50.9 57.1 63.2 74.7 
Russia 66.7 68.5 70.7 81.8 92.0 
Germany 63.1 62.0 62.5 68.2 95.4 
Austria-Hungary 20.8 23.4 24.6 25.5 26.9 
Total U+V 199.2 204.8 214.9 238.7 289.0 

A(U+ V) 5.6 10.1 23.8 50.3 


U+V at same date 202.0 209.8 226.8 263.8 


In Figure 1 we plot the annual increment of U+ V against the average 
of U+V for the two years used in forming the increment. Notice how 
close these four points, denoted by o, are to the straight line 


A(U+ V)=0.73(U+ V—194). (7) 


Thus, foreign politics does indeed have a machine-like predictability. 
Equations (6) and (7) imply that 


gth=(k—a)(U)+ V,)—A(U 5+ Vo) — 194 


and k—a=0.73. This is in excellent agreement with Richardson’s esti- 
mates of 0.9 for k and 0.2 for a. Finally, observe from (7) that the total de- 
fense budgets of the two alliances will increase if U+ V is greater than 194 
million, and will decrease otherwise. In actual fact, the defense expendi- 
tures of the two alliances was 199.2 million in 1909 while the trade between 
the two alliances amounted to only 171.8 million. Thus began an arms race 
which led eventually to World War I. 
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Figure 1. Graph of A(U+ V) versus U+ V 


Reference 


Richardson, L. F., “Generalized foreign politics,” The British Journal of Psychol- 
ogy, monograph supplement #23, 1939. 


EXERCISES 
1. Suppose that what moves a government to arm is not the magnitude of other na- 
tions’ armaments, but the difference between its own and theirs. Then, 
dx dy 
i ky x)-axtg, HAIG y)— By +h. 


Show that every solution of this system of equations is stable if k,/,;<(a,+ 
k,)( B, + li) and unstable if kyl, > (a, + kB + l,). 


2. Consider the case of three nations, each having the same defense coefficient k 
and the same restraint coefficient a. Then, 


Ow axthytke+gy 


dy 
a ay+kz+ 2) 


© a kxtky— 02+ 83 


x —a k k 81 
u-(9), A= kK -a k |, and g=| &2 |, 
Z k kK -a@ 83 


we see that u=Au+g. 

(a) Show that p(A)=det(A —AD = —(at+A)>+3k7(a +A) +2k?. 

(b) Show that p(A)=0 when A= —a-—k. Use this information to find the re- 
maining two roots of p(A). 


Setting 
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(c) Show that every solution u(¢) is stable if 2k <a, and unstable if 2k >a. 


3. Suppose in Problem 2 that the z nation is a pacifist nation, while x and y are 
pugnacious nations. Then, 


& = —axtky+kz+g, 
Ow kx— ay + ke + £5 (*) 
E =0-x+0-y—a0z +85, 


Show that every solution x(t),y(2),z(t) of (*) is stable if k<a, and unstable if 
k>a. 


4.5.2 Lanchester’s combat models and the battle of Iwo Jima 


During the first World War, F. W. Lanchester [4] pointed out the impor- 
tance of the concentration of troops in modern combat. He constructed 
mathematical models from which the expected results of an engagement 
could be obtained. In this section we will derive two of these models, that 
of a conventional force versus a conventional force, and that of a conven- 
tional force versus a guerilla force. We will then solve these models, or 
equations, and derive “Lanchester’s square law,” which states that the 
strength of a combat force is proportional to the square of the number of 
combatants entering the engagement. Finally, we will fit one of these mod- 
els, with astonishing accuracy, to the battle of Iwo Jima in World War II. 


(a) Construction of the models 


Suppose that an “x-force” and a “y-force” are engaged in combat. For 
simplicity, we define the strengths of these two forces as their number of 
combatants. (See Howes and Thrall [3] for another definition of combat 
strength.) Thus let x(t) and y(t) denote the number of combatants of the x 
and y forces, where ¢ is measured in days from the start of the combat. 
Clearly, the rate of change of each of these quantities equals its reinforce- 
ment rate minus its operational loss rate minus its combat loss rate. 


The operational loss rate. The operational loss rate of a combat force is its 
loss rate due to non-combat mishaps; i.e., desertions, diseases, etc. 
Lanchester proposed that the operational loss rate of a combat force is 
proportional to its strength. However, this does not appear to be very real- 
istic. For example, the desertion rate in a combat force depends on a host 
of psychological and other intangible factors which are difficult even to de- 
scribe, let alone quantify. We will take the easy way out here and consider 
only those engagements in which the operational loss rates are negligible. 


The combat loss rate. Suppose that the x-force is a conventional force 
which operates in the open, comparatively speaking, and that every mem- 
ber of this force is within “kill range” of the enemy y. We also assume that 
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as soon as the conventional force suffers a loss, fire is concentrated on the 
remaining combatants. Under these “ideal” conditions, the combat loss 
rate of a conventional force x equals ay(t), for some positive constant a. 
This constant is called the combat effectiveness coefficient of the y-force. 

The situation is very different if x is a guerilla force, invisible to its 
opponent y and occupying a region R. The y-force fires into R but cannot 
know when a kill has been made. It is certainly plausible that the combat 
loss rate for a guerilla force x should be proportional to x(¢), for the larger 
x(t), the greater the probability that an opponent’s shot will kill. On the 
other hand, the combat loss rate for x is also proportional to y(#), for the 
larger y, the greater the number of x-casualties. Thus, the combat loss rate 
for a guerilla force x equals cx(t)y(t), where the constant c is called the 
combat effectiveness coefficient of the opponent y. 


The reinforcement rate. The reinforcement rate of a combat force is the 
rate at which new combatants enter (or are withdrawn from) the battle. We 
denote the reinforcement rates of the x- and y-forces by f(t) and g(t) re- 
spectively. 

Under the assumptions listed above, we can now write down the follow- 
ing two Lanchestrian models for conventional—guerilla combat. 


d. 
Conventional combat: (1a) 
w __ bx + g(t) 
dt 
ax 
Conventional—guerilla combat: a °° HH) (1b) 
(x = guerilla) & = —dx+9(t) 


The system of equations (la) is a linear system and can be solved explicitly 
once a, b, f(t), and g(t) are known. On the other hand, the system of equa- 
tions (1b) is nonlinear, and its solution is much more difficult. (Indeed, it 
can only be obtained with the aid of a digital computer.) 

It is very instructive to consider the special case where the reinforce- 
ment rates are zero. This situation occurs when the two forces are “iso- 
lated.” In this case (1a) and (1b) reduce to the simpler systems 


Ho!) ae bx (2a) 
and 

dx gd _ 

po OW He dx. (2b) 
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Conventional combat: The square law. The orbits of system (2a) are the 
solution curves of the equation 


Y _ bx 


or ayo. = bx. 
dx ay dx 


Integrating this equation gives 
ay? — bx? = ay2— bx2= K. (3) 


The curves (3) define a family of hyperbolas in the x—y plane and we have 
indicated their graphs in Figure 1. The arrowheads on the curves indicate 
the direction of changing strengths as time passes. 

Let us adopt the criterion that one force wins the battle if the other 
force vanishes first. Then, y wins if K >0O since the x-force has been 


annihilated by the time y(t) has decreased to VK ja. Similarly, x wins if 
K<0. 


y(t) 


K>O: y wins 


VK/a K<O. x wins 


V-—K /b x(t) 
Figure 1. The hyperbolas defined by (3) 


Remark 1. Equation (3) is often referred to as “Lanchester’s square law,” 
and the system (2a) is often called the square law model, since the 
strengths of the opposing forces appear quadratically in (3). This terminol- 
ogy is rather anomolous since the system (2a) is actually a linear system. 


Remark 2. The y-force always seeks to establish a setting in which K >0. 
That is to say, the y-force wants the inequality 


ay, > bx? 


to hold. This can be accomplished by increasing a; i.e. by using stronger 
and more accurate weapons, or by increasing the initial force yo. Notice 
though that a doubling of a results in a doubling of ayé while a doubling of 
Yo results in a four-fold increase of ayg. This is the essence of Lanchester’s 
square law of conventional combat. 
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Conventional-guerilla combat. The orbits of system (2b) are the solution 
curves of the equation 


y dx id m 


Multiplying both sides of (4) by cy and integrating gives 

cy*—2dx =cyZ—2dx,)=M. (5) 
The curves (5) define a family of parabolas in the x—y plane, and we have 
indicated their graphs in Figure 2. The y-force wins if M >0, since the x- 


force has been annihilated by the time y(t) has decreased to V M/c . Simi- 
larly, x wins if M <0. 


y (t) 
M>O: y wins 


M=0: tie 


M<O: x wins 


x(t) 
Figure 2. The parabolas defined by (5) 


Remark. It is usually impossible to determine, a priori, the numerical value 
of the combat coefficients a, b, c, and d. Thus, it would appear that 
Lanchester’s combat models have little or no applicability to real-life en- 
gagements. However, this is not so. As we shall soon see, it is often possi- 
ble to determine suitable values of a and b (or c and d) using data from the 
battle itself. Once these values are established for one engagement, they are 
known for all other engagements which are fought under similar condi- 
tions. 


(b) The battle of Iwo Jima 


One of the fiercest battles of World War II was fought on the island of Iwo 
Jima, 660 miles south of Tokyo. Our forces coveted Iwo Jima as a bomber 
base close to the Japanese mainland, while the Japanese needed the island 
as a base for fighter planes attacking our aircraft on their way to bombing 
missions over Tokyo and other major Japanese cities. The American inva- 
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sion of Iwo Jima began on February 19, 1945, and the fighting was intense 
throughout the month long combat. Both sides suffered heavy casualties 
(see Table 1). The Japanese had been ordered to fight to the last man, and 
this is exactly what they did. The island was declared “secure” by the 
American forces on the 28th day of the battle, and all active combat 
ceased on the 36th day. (The last two Japanese holdouts surrendered in 
1951!) 


Table 1. Casualties at Iwo Jima 


Total United States casualties at Iwo Jima 
Killed, missing or Wounded Combat Fatigue Total 
died of wounds 
Marines 5,931 2,648 25,851 
Navy units: 
Ships and air units 633 
Medical corpsmen 195 
Seabees 51 
Doctors and dentists 2 
Army units in battle 9 
Grand totals 6,821 19,217 2,648 
Japanese casualties at Iwo Jima 
Defense forces Prisoners 
(Estimated) 
21,000 Marine 216 
Army 867 
Total 1,083 


(Newcomb [6], page 296) 


The following data is available to us from the battle of Iwo Jima. 

1. Reinforcement rates. During the conflict Japanese troops were 
neither withdrawn nor reinforced. The Americans, on the other hand, 
landed 54,000 troops on the first day of the battle, none on the second, 
6,000 on the third, none on the fourth and fifth, 13,000 on the sixth day, 
and none thereafter. There were no American troops on Iwo Jima prior to 
the start of the engagement. 

2. Combat losses. Captain Clifford Morehouse of the United States 
Marine Corps (see Morehouse [5]) kept a daily count of all American com- 
bat losses. Unfortunately, no such records are available for the Japanese 
forces. Most probably, the casualty lists kept by General Kuribayashi 
(commander of the Japanese forces on Iwo Jima) were destroyed in the 
battle itself, while whatever records were kept in Tokyo were consumed in 
the fire bombings of the remaining five months of the war. However, we 
can infer from Table 1 that approximately 21,500 Japanese forces were on 
Iwo Jima at the start of the battle. (Actually, Newcomb arrived at the fig- 
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ure of 21,000 for the Japanese forces, but this is a little low since he appar- 
ently did not include some of the living and dead found in the caves in the 
final days.) 

3. Operational losses. The operational losses on both sides were negligi- 
ble. 

Now, let x(t) and y(t) denote respectively, the active American and 
Japanese forces on Iwo Jima rt days after the battle began. The data above 
suggests the following Lanchestrian model for the battle of Iwo Jima: 


dx _ 


Ho Mth) 

(6) 
y = — bx 
at 


where a and b are the combat effectiveness coefficients of the Japanese 
and American forces, respectively, and 


54,000 O<t<l 
0 l<t<2 
6,000 2<1t<3 
t)=4 ™ 
F(Z) 0 3<t<5 
13,000 5<t<6 
0 t>6 


Using the method of variation of parameters developed in Section 3.12 or 
the method of elimination in Section 2.14, it is easily seen that the solution 
of (6) which satisfies x(0)=0, y(0)=y)= 21,500 is given by 


“(j= yocoshV ab t+ i ‘cosh Vab (t—s)f(s)ds (7a) 


and 


a 


y(t)=y,cosh Vab -\/? ['sinhVab (1-5) f(s)ds (7) 
0 
where 
coshx=(e*+e~*)/2 and sinhx=(e*—e *)/2. 


The problem before us now is this: Do there exist constants a and 5 so 
that (7a) yields a good fit to the data compiled by Morehouse? This is an 
extremely important question. An affirmative answer would indicate that 
Lanchestrian models do indeed describe real life battles, while a negative 
answer would shed a dubious light on much of Lanchester’s work. 

As we mentioned previously, it is extremely difficult to compute the 
combat effectiveness coefficients a and b of two opposing forces. However, 
it is often possible to determine suitable values of a and b once the data for 
the battle is known, and such is the case for the battle of Iwo Jima. 
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The calculation of a and.b. Integrating the second equation of (6) between 
0 and s gives 


y(s)—yo= —b f “x(t) dt 


so that 
—y(s 
b= “ y(s) (8) 
[ x(ae 
0 
In particular, setting s=36 gives 
Yo ¥(36) 21,500 
= (9) 


J * x(t) dt J * e(t) dt | 


Now the integral on the right-hand side of (9) can be approximated by the 
Riemann sum 
36 


[ox@a= > x(i) 
0 i=1 


and for x(i) we enter the number of effective American troops on the /th 
day of the battle. Using the data available from Morehouse, we compute 
for b the value 


21,500 


b= 2,037,000 


= (0.0106. (10) 


Remark. We would prefer to set s=28 in (8) since that was the day the is- 
land was declared secure, and the fighting was only sporadic after this day. 
However, we don’t know y(28). Thus, we are forced here to take s =36. 


Next, we integrate the first equation of (6) between t=0 and ¢=28 and 
obtain that 


28 28 
x(28)=—af y(iart f f(tjat 
28 
=-af y(t)dt+73,000. 


There were 52,735 effective American troops on the 28th day of the battle. 
Thus 


a= 73,000 — 52,735 _ 20,265 


28 28 ; (11) 
[ova [oat 
0 0 
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Finally, we approximate the integral on the right-hand side of (11) by the 
Riemann sum 


28 28 
[ rae vV 
0 j=l 
and we approximate y(/) by 
y(j)=Vo~b [ "x (a) ae 
0 


J 
=21,500~b > x(i). 


Again, we replace x(i) by the number of effective American troops on the 
ith day of the battle. The result of this calculation is (see Engel [2]) 


x 20,265 
372,500 


= 0.0544. (12) 


Figure 3 below compares the actual American troop strength with the val- 
ues predicted by Equation (7a) (with a=0.0544 and b=0.0106). The fit is 
remarkably good. Thus, it appears that a Lanchestrian model does indeed 
describe real life engagements. 

74,000 
70,000 


66,000 


62,000 —--- ACTUAL 


~ PREDICTED 
98,000 
TROOPS 


AMERICAN FORCES 


54,000 


50 ,000 
4 8 12 16 20 24 28 32 36 


DAYS 


Figure 3. Comparison of actual troop strength with predicted troop strength 


Remark. The figures we have used for American reinforcements include 
all the personnel put ashore, both combat troops and support troops. Thus 
the numbers a and b that we have computed should be interpreted as the 
average effectiveness per man ashore. 
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EXERCISES 
1. Derive Equations (7a) and (7b). 
2. The system of equations 
eT ® (13) 
y= — by — exy 


is a Lanchestrian model for conventional-guerilla combat, in which the opera- 
tional loss rate of the guerilla force y is proportional to y(7). 

(a) Find the orbits of (13). 

(b) Who wins the battle? 


. The system of equations 


x=-—a 

oe (14) 
y=—bx-cxy 

is a Lanchestrian model for conventional—guerilla combat, in which the opera- 

tional loss rate of the guerilla force y is proportional to the strength of the con- 

ventional force x. Find the orbits of (14). 


. The system of equations 


x =—cxy 
15 
a (15) 

is a Lanchestrian model for guerilla—guerilla combat in which the operational 

loss rates are negligible. 

(a) Find the orbits of (15). 

(b) Who wins the battle? 


. The system of equations 


x= —ay—cxy (16) 
y= — bx—dxy 
is a Lanchestrian model for guerilla—guerilla combat in which the operational 


loss rate of each force is proportional to the strength of its opponent. Find the 
orbits of (16). 


413 


4 Qualitative theory of differential equations 


6. The system of equations 


X= —ax—cxy (17) 
y=—by—dxy 

is a Lanchestrian model for guerilla—guerilla combat in which the operational 
loss rate of each force is proportional to its strength. 
(a) Find the orbits of (17). 
(b) Show that the x and y axes are both orbits of (17). 
(c) Using the fact (to be proved in Section 4.6) that two orbits of (17) cannot 
intersect, show that there is no clear-cut winner in this battle. Hint: Show that 
x(t) and y(¢) can never become zero in finite time. (Using lemmas | and 2 of 
Section 4.8, it is easy to show that both x(t) and y(t) approach zero as t->00.) 


4.6 Qualitative properties of orbits 


In this section we will derive two very important properties of the solutions 
and orbits of the system of differential equations 


xy Fi (1, -++5 Xp) 
x=f(x), x=]: |, ~~ f(x)= . (1) 


fie per%y) 


The first property deals with the existence and uniqueness of orbits, and 
the second property deals with the existence of periodic solutions of (1). 
We begin with the following existence-uniqueness theorem for the solu- 
tions of (1). 


Theorem 3. Let each of the functions f\(X),.-+5X%_)>+++oSy(Xq>+++sX,_) Nave con- 
tinuous partial derivatives with respect to x,,...,X,. Then, the initial-value 
problem x={(x), X(to)=x° has one, and only one solution x=x(t), for ev- 
ery x° in R". 


We prove Theorem 3 in exactly the same manner as we proved the ex- 
istence-uniqueness theorem for the scalar differential equation x =f (t,x). 
Indeed, the proof given in Section 1.10 carries over here word for word. 
We need only interpret the quantity |x(7)—y(s)|, where x(t) and y() are 
vector-valued functions, as the length of the vector x(t)—y(¢). That is to 
say, if we interpret |x(7¢)—y(z)| as 


Ix(¢)—y(4)| =max{|x, (2) (4).  e) —ya(D1}, 
then the proof of Theorem 2, Section 1.10 is valid even for vector-valued 
functions f(¢,x) (see Exercises 13-14). 
Next, we require the following simple but extremely useful lemma. 


Lemma 1. Jf x=¢(12) is a solution of (1), then x=(t + c) is again a solution 


of (1). 
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The meaning of Lemma 1 is the following. Let x=(7) be a solution of 
(1) and let us replace every ¢ in the formula for $(t) by ¢+c. In this 
manner we obtain a new function x(t)=(t+c). Lemma | states that xX(¢) 
is again a solution of (1). For example, x,= tant, x,=sec’t is a solution of 
the system of differential equations dx,/dt=x,, dx,/dt=2x,x,. Hence, 
x,=tan(t+c), x,=sec?(t+c) is again a solution, for any constant c. 


PROOF OF LEMMA |. If x=@(f) is a solution of (1), then do(t)/dt=f($(4)); 
that is, the two functions d(t)/dt and h(t)=f(¢(4)) agree at every single 
time. Fix a time ¢ and a constant c. Since d/dt and h agree at every time, 
they must agree at time +c. Hence, 


Ort c)=h(t+c)=f(6(t+c)). 


But, dp/dt evaluated at ¢+c equals the derivative of x(t)=¢(t+ c), 
evaluated at ¢. Therefore, 


d 
Tete )=Ho(t+ c)). O 


Remark 1. Lemma | can be verified explicitly for the linear equation x= 
Ax. Every solution x(f) of this equation is of the form x(t)=e*“‘v, for some 
constant vector v. Hence, 


x(t +c) = eAUt y= eAeAcy 


since (At)Ac = Ac(Apd) for all values of ¢ and c. Therefore, x(¢+ cc) 1s again 

a solution of x=Ax since it is of the form e“” times the constant vector 
Ac 

e Vv. 


Remark 2. Lemma | is not true if the function f in (1) depends explicitly 
on ¢. To see this, suppose that x= @(f) is a solution of the nonautonomous 
differential equation x=f(t,x). Then, $(¢+c)=f(t+ c,@(t+c)). Conse- 
quently, the function x=@(t+c) satisfies the differential equation 


k=f(t+c,x), 


and this equation is different from (1) if f depends explicitly on ¢. 


We are now in a position to derive the following extremely important 
properties of the solutions and orbits of (1). 


Property 1. (Existence and uniqueness of orbits.) Let each of the functions 
F(X), +++ Xp_)>-+ +t (X15--+,X,,) have continuous partial derivatives with re- 
spect to x,,...,x,. Then, there exists one, and only one, orbit through 
every point x° in R”. In particular, if the orbits of two solutions x = (1) 
and x= y(¢) of (1) have one point in common, then they must be identi- 
cal. 
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Property 2. Let x =¢(f) be a solution of (1). If 6(¢9+ T)=$(4,) for some fy 
and T >0, then (t+ T) is identically equal to @(¢). In other words, if a 
solution x(¢) of (1) returns to its starting value after a time T >0, then it 
must be periodic, with period T (i.e. it must repeat itself over every time 
interval of length T.) 


PROOF OF PROPERTY 1. Let x° be any point in the n-dimensional phase 
space xX,,...,x,, and let x=@(t) be the solution of the initial-value problem 
X= f(x), x(0)=x°. The orbit of this solution obviously passes through x®. 
Hence, there exists at least one orbit through every point x°. Now, suppose 
that the orbit of another solution x= y(t) also passes through x°. This 
means that there exists f9(40) such that ¥(t9)=x°. By Lemma 1, 


x=yY(t+f)) 
is also a solution of (1). Observe that ¥(t+ 7%) and (7) have the same 
value at t=0. Hence, by Theorem 3, ¥(t+¢)) equals (¢) for all time ¢. 
This implies that the orbits of @(¢) and y(Z) are identical. To wit, if € is a 
point on the orbit of (7); that is, €=(t,) for some ¢,, then € is also on the 
orbit of Y(2), since €=(t,)= Y(t, + to). Conversely, if € is a point on the 
orbit of y(t); that is, there exists ¢, such that y(t,)=&, then € is also on the 
orbit of $(¢) since €= P(t,) = (1, — fo). O 


PROOF OF PROPERTY 2. Let x=@(t) be a solution of (1) and suppose that 
6(t9 + T)=(t) for some numbers ¢, and 7. Then, the function Y(4)= 
(t+ T) is also a solution of (1) which agrees with $(f) at time t=. By 
Theorem 3, therefore, ¥(¢)=(t+ T) is identically equal to (f). O 


Property 2 is extremely useful in applications, especially when n=2. Let 
x= x(t), y=y(t) be a periodic solution of the system of differential equa- 
tions 

& oj(xy),  Fag(xy) (2) 
If x(¢+7)=x(t) and y(t+ T)=y(d), then the orbit of this solution is a 
closed curve C in the x-y plane. In every time interval tj<t<%)+T, the 
solution moves once around C. Conversely, suppose that the orbit of a 
solution x = x(t), y=y(t) of (2) 1s a closed curve containing no equilibrium 
points of (2). Then, the solution x = x(t), y= y(t) is periodic. To prove this, 
recall that a solution x= x(t), y=y(t) of (2) moves along its orbit with 
velocity [ f7(x,v) + 27(x,y)]'/’. If its orbit C is a closed curve containing no 
equilibrium points of (2), then the function [ f?(x,y)+27(x,y)]'/ has a 
positive minimum for (x,y) on C. Hence, the orbit of x= x(t), y=y(d) 
must return to its starting point x>= x(t), Yo = (to) in some finite time T. 
But this implies that x(¢+ T)= x(t) and y(t+ T)=y (2) for all ¢. 


Example 1. Prove that every solution z(t) of the second-order differential 
equation (dz /dt*)+ z+2z°=0 is periodic. 
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ProoF. We convert this second-order equation into a system of two first- 
order equations by setting x =z, y=dz/dt. Then, 


ax _ Y __ x3 
i” i re (3) 
The orbits of (3) are the solution curves 
2 2 4 
YX yg X 22 
5 + 5 + 4 76 (4) 


of the scalar equation dy /dx = —(x + x°)/y. Equation (4) defines a closed 
curve in the x—y plane (see Exercise 7). Moreover, the only equilibrium 
point of (3) is x =0, y=0. Consequently, every solution x = z(t), y=z’(t) of 
(3) is a periodic function of time. Notice, however, that we cannot compute 
the period of any particular solution. O 


Example 2. Prove that every solution of the system of differential equa- 
tions ay 

a =yeltx?ty” 7 _ — xeltxry’ (5) 
is periodic. 
Solution. The orbits of (5) are the solution curves x?+y?=c? of the first- 
order scalar equation dy /dx = — x/y. Moreover, x =0, y=0O is the only 
equilibrium point of (5). Consequently, every solution x= x(t), y=y(Z) of 
(5) is a periodic function of time. 


EXERCISES 


1. Show that all solutions x(7), y(t) of 


ax _ 2 . dy 
a * +ysinx, a 1+ xy +cosy 


which start in the first quadrant (x >0, y >0) must remain there for all time 
(both backwards and forwards). 


2. Show that all solutions x(t), y(t) of 


dx _ x dy y 
a 7 le 1), i =x+t+e 


which start in the right half plane (x >0) must remain there for all time. 


3. Show that all solutions x(t), y(¢) of 
dx _ 24,2 dy _ 
i l+x‘+y*, i xy + tany 


which start in the upper half plane (y >0) must remain there for all time. 


4. Show that all solutions x(7), y(2) of 

ax —_4_ 2 dy _ 

= l-yt+x’*, 77 =x+xy 

which start inside the unit circle x*+y?=1 must remain there for all time. 
Hint: Compute d(x?+y7)/dt. 


417 


4 Qualitative theory of differential equations 


5. Let x(2), y(2) be a solution of 


BH my t x2, Y _xty 


with x(to)# y(t). Show that x(7) can never equal y(t). 


6. Can a figure 8 ever be an orbit of 


dx dy 
a TL (ey) a Be) 
where f and g have continuous partial derivatives with respect to x and y? 


7. Show that the curve y?+ x?+ x4/2=2c? is closed. Hint: Show that there exist 
two points y =0, x = ta which lie on this curve. 


Prove that all solutions of the following second-order equations are peri- 
odic. 


2 2 
g, 42 473=0 9, 42 47475=0 
dt? dt? 
2 2 
10, 2 467=1 1, 4244 =0 
dt? dt? 1+z 
12. Show that all solutions z(¢) of 
2 
a2 4 7-273=0 
dt? 


are periodic if 27(0)+ z7(0) — z4*(0) <4, and unbounded if 
27(0)+2z7(0)—z*(0) >}. 


13. (a) Let 
L=nX max |af,/dx;|, for |x—x|<b. 
ij= n 


Show that |f(x) —f(y)| < L|x—y]| if |x—x®| <b and |y—x°| <b. 
(b) Let M=max(|f(x)| for |\x—x°| < b. Show that the Picard iterates 


Xj 41(t)=x°+ J 'f(x,(s))ds,  xo(t) =x° 


converge to a solution x(f) of the initial-value problem x=f(x), x(¢) = x? on 
the interval |t—t)|<b/M. Hint: The proof of Theorem 2, Section 1.10 
carries over here word for word. 


14. Compute the Picard iterates x,(¢) of the initial-value problem x= Ax, x(0) =x°, 
and verify that they approach e4‘x° as j approaches infinity. 


4.7 Phase portraits of linear systems 


In this section we present a complete picture of all orbits of the linear dif- 
ferential equation 


em (3) (EB 
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This picture is called a phase portrait, and it depends almost completely on 
the eigenvalues of the matrix A. It also changes drastically as the eigenval- 
ues of A change sign or become imaginary. 

When analyzing Equation (1), it is often helpful to visualize a vector 


-( 


in R2 as a direction, or directed line segment, in the plane. Let 


(5) 


be a vector in R? and draw the directed line segment xX from the point (0,0) 
to the point (x,,x,), as in Figure la. This directed line segment is parallel 
to the line through (0,0) with direction numbers x,,x, respectively. If we 
visualize the vector x as being this directed line segment x, then we see that 
the vectors x and cx are parallel if c is positive, and antiparallel if c is 
negative. We can also give a nice geometric interpretation of vector addi- 
tion. Let x and y be two vectors in R*. Draw the directed line segment x, 
and place the vector y at the tip of x. The vector x+y is then the composi- 


Xo 


(-2,-1) 


(a) (b) 
Figure | 


(X,+Y, Xt Yo) 


Figure 2 
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tion of these two directed line segments (see Figure 2). This construction is 
known as the parallelogram law of vector addition. 

We are now in a position to derive the phase portraits of (1). Let A, and 
A, denote the two eigenvalues of A. We distinguish the following cases. 

1. A,<A,<0. Let v! and v’ be eigenvectors of A with eigenvalues A, and 
A, respectively. In the x, — x, plane we draw the four half-lines /,, /;, /,, and 
1}, as shown in Figure 3. The rays /, and J, are parallel to v' and y’, while 
the rays /{ and Jj are parallel to —v' and —v*. Observe first that x()= 
ce*'y! is a solution of (1) for any constant c. This solution is always pro- 
portional to v', and the constant of proportionality, ce*"’, runs from + 00 to 
0, depending as to whether c is positive or negative. Hence, the orbit of this 
solution is the half-line /, for c>0, and the half-line /; for c<0. Similarly, 
the orbit of the solution x(‘)=ce’v’ is the half-line /, for c>0, and the 
half-line /; for c<0. The arrows on these four lines in Figure 3 indicate in 
what direction x(t) moves along its orbit. 

Next, recall that every solution x(t) of (1) can be written in the form 


x(t)=c,e*"v' + c,e*v? (2) 


for some choice of constants c, and c,. Obviously, every solution x(f) of (1) 
approaches (3) as ¢ approaches infinity. Hence, every orbit of (1) ap- 
proaches the origin x,=x,=0 as ¢ approaches infinity. We can make an 
even stronger statement by observing that ey? is very small compared to 
e\'y! when ¢ is very large. Therefore, x(1), for c, #0, comes closer and 
closer to c,e*“v! as ¢ approaches infinity. This implies that the tangent to 
the orbit of x(t) approaches /, if c, is positive, and /; if c, is negative. Thus, 


Xo 


Figure 3. Phase portrait of a stable node 
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the phase portrait of (1) has the form described in Figure 3. The dis- 
tinguishing feature of this phase portrait is that every orbit, with the excep- 
tion of a single line, approaches the origin in a fixed direction (if we con- 
sider the directions v' and —v! equivalent). In this case we say that the 
equilibrium solution x=0 of (1) is a stable node. 


Remark. The orbit of every solution x(¢) of (1) approaches the origin x, = 
xX, =0 as ¢ approaches infinity. However, this point does not belong to the 
orbit of any nontrivial solution x(f). 


1’. O<A, <A). The phase portrait of (1) in this case is exactly the same 
as Figure 3, except that the direction of the arrows is reversed. Hence, the 
equilibrium solution x(¢)=0 of (1) is an unstable node if both eigenvalues 
of A are positive. 


2. A, =A, <0. In this case, the phase portrait of (1) depends on whether 
A has one or two linearly independent eigenvectors. (a) Suppose that A has 
two linearly independent eigenvectors v' and vy? with eigenvalue (<0. In 
this case, every solution x(t) of (1) can be written in the form 


x(t) =c,e“v' + cse“v? = e™ (cv! + cv’) (2) 


for some choice of constants c, and c,. Now, the vector e*’(c,v! + c,v’) is 
parallel to c,v'+c,v? for all ¢. Hence, the orbit of every solution x(t) of (1) 
is a half-line. Moreover, the set of vectors {c,v' + c,v’}, for all choices of c, 
and c,, cover every direction in the x,—.x, plane, since v' and v’ are lin- 
early independent. Hence, the phase portrait of (1) has the form described 
in Figure 4a. (b) Suppose that A has only one linearly independent eigen- 
vector v, with eigenvalue X. In this case, x'(t)= ev is one solution of (1). 
To find a second solution of (1) which is independent of x', we observe 
that (A—AI)*u=0 for every vector u. Hence, 


x(t) = e“tu=eNeA— Dy = eT ut t(A—ADu] (3) 


is a solution of (1) for any choice of u. Equation (3) can be simplified by 
observing that (A—AI)u must be a multiple k of v. This follows im- 
mediately from the equation (A—AI[(A—ADu]=90, and the fact that A has 
only one linearly independent eigenvector vy. Choosing wu independent of v, 
we see that every solution x(f) of (1) can be written in the form 


x(t) =c,e“v+ cye™ (ut ktv) = e™ (c,\v+ cout c)ktv), (4) 


for some choice of constants c, and c,. Obviously, every solution x(t) of (1) 
approaches (9) as ¢ approaches infinity. In addition, observe that cv+c.u 
is very small compared to c,ktv if c, is unequal to zero and t¢ is very large. 
Hence, the tangent to the orbit of x(t) approaches +v (depending on the 
sign of c)) as ¢ approaches infinity, and the phase portrait of (1) has the 
form described in Figure 4b. 
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(a) (b) 
Figure 4 


2’. A; =A, >0. The phase portraits of (1) in the cases (2a)’ and (2b)’ are 
exactly the same as Figures 4a and 4b, except that the direction of the 
arrows is reversed. 

3. A, <O<A,. Let v! and v’ be eigenvectors of A with eigenvalues A, and 
A, respectively. In the x, — x, plane we draw the four half-lines /,, /;, /,, and 
1}; the half-lines /, and /, are parallel to v! and v’, while the half-lines /; 
and J; are parallel to —v! and — vy’. Observe first that every solution x(t) of 
(1) is of the form 


x(t) =c,e*"'y! + c,e*"v? (5) 


for some choice of constants c, and c,. The orbit of the solution x(¢)= 
cje\"v! is 1, for c, >O0 and J; for c,; <0, while the orbit of the solution 
x(t)=c,e*y? is 1, for c, >0 and J; for c, <0. Note, too, the direction of the 
arrows on J, Jj, 1,, and J; the solution x(¢)=c,e*"v' approaches (0) as t 
approaches infinity, whereas the solution x(t)=c,e*v? becomes un- 
bounded (for c,#0) as t approaches infinity. Next, observe that e*"v! is 
very small compared to e*‘y? when ¢ is very large. Hence, every solution 
x(t) of (1) with c,4#0 becomes unbounded as ¢ approaches infinity, and its 
orbit approaches either /, or /;. Finally, observe that ev? is very small 
compared to e*"v' when ¢ is very large negative. Hence, the orbit of any 
solution x(t) of (1), with c, 0, approaches either /, or /; as t approaches 
minus infinity. Consequently, the phase portrait of (1) has the form de- 
scribed in Figure 5. This phase portrait resembles a “saddle” near x, = x, = 
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1 Lo 


Figure 5. Phase portrait of a saddle point 


0. For this reason, we say that the equilibrium solution x(t)=0 of (1) is a 
saddle point if the eigenvalues of A have opposite sign. 

4. A, =a+iB,A,=a—iB,B4O0. Our first step in deriving the phase 
portrait of (1) is to find the general solution of (1). Let z=u+ iv be an ei- 
genvector of A with eigenvalue a+ i8. Then, 

x(t) = e(* 8)" (y+ iv) = e (cos Bt + isin Bt)(ut iv) 
= eT ucos Bt — vsin Bt] + ie“ [ usin Bt + vcos Br | 
is a complex-valued solution of (1). Therefore, 
x'(t) =e ucos Bt —vsin Bt] 
and 
x’(t) =e usin Bt + vcos Br] 


are two real-valued linearly independent solutions of (1), and every solu- 
tion x(t) of (1) is of the form x(t)=c,x'(t) + c)x*(t). This expression can be 
written in the form (see Exercise 15) 


R, cos( Bt— 6, 


R,cos( Bt— 8, ) (9) 


x(t) =e ( 


for some choice of constants R, 20, R, 20, 6,, and 6,. We distinguish the 
following cases. 
(a) a=0: Observe that both 


x,(t)=R,cos( Bt—6,) and x,(t)=R,cos( Bt— 4, ) 


are periodic functions of time with period 27/8. The function x,(¢) varies 
between — R, and + R,, while x,(7) varies between — R, and + R,. Conse- 
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quently, the orbit of any solution x(f) of (1) is a closed curve surrounding 
the origin x, = x,=0, and the phase portrait of (1) has the form described 
in Figure 6a. For this reason, we say that the equilibrium solution x(#)=0 
of (1) is a center when the eigenvalues of A are pure imaginary. 

The direction of the arrows in Figure 6a must be determined from the 
differential equation (1). The simplest way of doing this is to check the sign 
of x, when x,=0. If x, is greater than zero for x,=0 and x,>0, then all 
solutions x(z) of (1) move in the counterclockwise direction; if x, is less 
than zero for x,=0 and x,>0, then all solutions x(t) of (1) move in the 
clockwise direction. 


Xo Xo 


(a) (b) 
X2 


(c) 
Figure 6. (a) a=0; (b) a<0; (c)a>0 
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(b) a<0: In this case, the effect of the factor e” in Equation (6) is to 
change the simple closed curves of Figure 6a into the spirals of Figure 6b. 
This is because the point x(27/B)=e?"*/*x(0) is closer to the origin than 
x(0). Again, the direction of the arrows in Figure 6b must be determined 
directly from the differential equation (1). In this case, we say that the 
equilibrium solution x(¢)=0 of (1) 1s a stable focus. 

(c) a>0: In this case, all orbits of (1) spiral away from the origin as f¢ 
approaches infinity (see Figure 6c), and the equilibrium solution x(t)=0 of 
(1) is called an unstable focus. 

Finally, we mention that the phase portraits of nonlinear systems, in the 
neighborhood of an equilibrium point, are often very similar to the phase 
portraits of linear systems. More precisely, let x=x° be an equilibrium 
solution of the nonlinear equation x=f(x), and set u=x—x°. Then, (see 
Section 4.3) we can write the differential equation x =f(x) in the form 


u=Au-+ g(u) (7) 


where A is a constant matrix and g(u) is very small compared to u. We 
state without proof the following theorem. 


Theorem 4. Suppose that u=0 is either a node, saddle, or focus point of the 
differential equation t= Au. Then, the phase portrait of the differential 
equation x=f(x), in a neighborhood of x=x°, has one of the forms de- 
scribed in Figures 3, 5, and 6 (b and c), depending as to whether u=0 is a 
node, saddle, or focus. 


Example 1. Draw the phase portrait of the linear equation 
i= Ax=(~? ax. (8) 


Solution. It is easily verified that 


v=({) a ¥=() 


are eigenvectors of A with eigenvalues —3 and —6, respectively. There- 
fore, x=0 is a stable node of (8), and the phase portrait of (8) has the form 
described in Figure 7. The half-line /; makes an angle of 45° with the x, 
axis, while the half-line /, makes an angle of @ degrees with the x,-axis, 
where tan@=4. 


Example 2. Draw the phase portrait of the linear equation 
1 -3 
= Ax= 
xwaxe( 5 t)s ° 
Solution. It is easily verified that 
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Xe 


t 
Figure 7. Phase portrait of (8) 


Xo 


X, 


Figure 8. Phase portrait of (9) 


are eigenvectors of A with eigenvalues —2 and 4, respectively. Therefore, 
x =0 is a saddle point of (9), and its phase portrait has the form described 
in Figure 8. The half-line /, makes an angle of 45° with the x,-axis, and the 
half-line /, is at right angles to /). 


Example 3. Draw the phase portrait of the linear equation 


k= Ax=( - _i)x (10) 
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Solution. The eigenvalues of A are —1+i. Hence, x=0 is a stable focus of 
(10) and every nontrivial orbit of (10) spirals into the origin as ¢ ap- 
proaches infinity. To determine the direction of rotation of the spiral, we 
observe that x,= — x, when x,=0. Thus, x, is negative for x, >0 and x,= 
0. Consequently, all nontrivial orbits of (10) spiral into the origin in the 
clockwise direction, as shown in Figure 9. 


Xo 


Figure 9. Phase portrait of (10) 


EXERCISES 


Draw the phase portraits of each of the following systems of differential 
equations. 


1. 


11. 


12. 


ae l .-_(0 —-1 _ 4 -1 
i= ( i 5) 2. x (? “Ep 3. Xx (_5 5) 


- {(-4 —-] _ 1 —4 .-_{3 —-1 
x= , 75) 5. x (_3 4} 6. x (2 73) 


_ 0 2 .-_{1 —1 _ 2 ] 
.x=(_$ _i)x 8. x (5 3) 9. x (_¢ 3) 


10. 


is an ellipse. 


The equation of motion of a spring-mass system with damping (see Section 2.6) 
is mZ + cZ + kz=0O, where m, c, and k are positive numbers. Convert this equa- 
tion to a system of first-order equations for x =z, y=Z, and draw the phase 
portrait of this system. Distinguish the overdamped, critically damped, and un- 
derdamped cases. 


Suppose that a 2X2 matrix A has 2 linearly independent eigenvectors with ei- 
genvalue A. Show that A=AI. 
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13. This problem illustrates Theorem 4. Consider the system 


a =y, e =x+2x?, (*) 
(a) Show that the equilibrium solution x =0, y =0 of the linearized system x = 
y, Y=x 18 a Saddle, and draw the phase portrait of the linearized system. 
(b) Find the orbits of (*), and then draw its phase portrait. 
(c) Show that there are exactly two orbits of (*) (one for x >0 and one for x < 
0) on which x->0, y->0 as t-»oo. Similarly, there are exactly two orbits of 
(*) on which x-0, y-0 as t—— 00. Thus, observe that the phase portraits 
of (*) and the linearized system look the same near the origin. 


14. Verify Equation (6). Hint: The expression acoswt+ bsinwt can always be 
written in the form Rcos(wt— 6) for suitable choices of R and 6. 


4.8 Long time behavior of solutions; 
the Poincaré—Bendixson Theorem 


We consider now the problem of determining the long time behavior of all 
solutions of the differential equation 


x) Fi (X1,-++5%q) 
x=f(x), x=]: |, f(x)= , (1) 


xn Fy (Xp 000s Xp_) 


This problem has been solved completely in the special case that f(x) = Ax. 
As we have seen in Sections 4.2 and 4.7, all solutions x(t) of x=Ax must 
exhibit one of the following four types of behavior: (i) x(#) is constant in 
time; (11) x(¢) 1s a periodic function of time; (iii) x(t) is unbounded as ¢ ap- 
proaches infinity; and (iv) x(t) approaches an equilibrium point as ¢ ap- 
proaches infinity. 

A partial solution to this problem, in the case of nonlinear f(x), was 
given in Section 4.3. In that section we provided sufficient conditions that 
every solution x(t) of (1), whose initial value x(0) is sufficiently close to an 
equilibrium point € must ultimately approach € as ¢ approaches infinity. In 
many applications it is often possible to go much further and prove that 
every physically (biologically) realistic solution approaches a single 
equilibrium point as time evolves. In this context, the following two 
lemmas play an extremely important role. 


Lemma 1. Let g(t) be a monotonic increasing (decreasing) function of time 
for t > to, with g(t) < c(>c) for some constant c. Then, g(t) has a limit as 
t approaches infinity. 


ProorF. Suppose that g(f) is monotonic increasing for t >t, and g(f) is 
bounded from above. Let / be the least upper bound of g; that is, / is the 
smallest number which is not exceeded by the values of g(t), for t > tf). This 
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number must be the limit of g(¢) as t approaches infinity. To prove this, let 
e>0 be given, and observe that there exists a time ¢, > f,) such that /— g(¢,) 
<e. (If no such time ¢, exists, then / is not the least upper bound of g.) 
Since g(t) is monotonic, we see that /— g(t)<e for t>4,. This shows that 


/=lim,_,.. &(¢)- OJ 


Lemma 2. Suppose that a solution x(t) of (1) approaches a vector & as t ap- 
proaches infinity. Then, & is an equilibrium point of (1). 


PRooF. Suppose that x(‘) approaches € as ¢ approaches infinity. Then, 
x;(t) approaches &, where is the jth component of £. This implies that 
|x;(¢,) — x,(t,)| approaches zero as both ¢, and ¢, approach infinity, since 


x(1) — (QM (4) - §) +(§- 4 (H))| 
<[5(41) —§1 +15 (4) -§ 
In particular, let ¢;=¢ and t,=¢t,+h, for some fixed positive number A. 
Then, |x,;(¢+ h)— x,(0)| approaches zero as ¢ approaches infinity. But 
dx;(T) 
x(t+h)—x(t)=h i hf, (x(7),.++)X,(7)) 
where rt is some number between ¢ and t+h. Finally, observe that 


f(x\(7),---,%,(7)) must approach fi(g),...,§,) as ¢ approaches infinity. 
Hence, f(§,,...,€) =0, 7=1,2,...,, and this proves Lemma 1. a 


Example 1. Consider the system of differential equations 


B= ax— bry — ex’, O = — cyt dy fy? (2) 
where a, b, c, d, e, and f are positive constants. This system (see Section 
4.10) describes the population growth of two species x and y, where species 
y is dependent upon species x for its survival. Suppose that c/d>a/e. 
Prove that every solution x(t), y(¢) of (2), with x(0) and y(0)>0, ap- 
proaches the equilibrium solution x =a/e, y=0, as ¢t approaches infinity. 
Solution. Our first step is to show that every solution x(t), y(t) of (2) which 
starts in the first quadrant (x>0, y>0) at t=O must remain in the first 
quadrant for all future time. (If this were not so, then the model (2) could 
not correspond to reality.) To this end, recall from Section 1.5 that 
aX, 


*() exyt+(a—exy)e% y(N=0 
is a solution of (2) for any choice of x). The orbit of this solution is the 
point (0,0) for x» =0; the line O0<x<a/e for 0<x)<a/e; the point 
(a/e,0) for x»=a/e; and the line a/e<x< oo for xy >a/e. Thus, the x- 
axis, for x > 0, is the union of four disjoint orbits of (2). Similarly, (see Ex- 
ercise 14), the positive y-axis is a single orbit of (2). Thus, if a solution 
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a/e c/d 
Figure | 


x(t), y(t) of (2) leaves the first quadrant, its orbit must cross another orbit, 
and this is precluded by the uniqueness of orbits (Property 1, Section 4.6). 

Our next step is to divide the first quadrant into regions where dx /dt 
and dy/dt have fixed signs. This is accomplished by drawing the lines 
l,:a—by—ex=0, and /,: —c+dx—fy=0, in the x-y plane. These lines 
divide the first quadrant into three regions I, II, and II as shown in Figure 
1. (The lines /, and /, do not intersect in the first quadrant if c/d>a/e.) 
Now, observe that ex + by is less than a in region I, while ex + by is greater 
than a in regions II and III. Consequently, dx /dt is positive in region | 
and negative in regions II and III. Similarly, dy /dt is negative in regions I 
and II and positive in region III. 

Next, we prove the following four simple lemmas. 


Lemma 3. Any solution x(t), y(t) of (2) which starts in region I at time t = to 
will remain in this region for all future time t > t, and ultimately approach 
the equilibrium solution x =a/e, y =0. 


PRoor. Suppose that a solution x(¢), y(t) of (2) leaves region I at time t= 
t*, Then, x(¢*)=0, since the only way a solution can leave region I is by 
crossing the line /,. Differentiating both sides of the first equation of (2) 
with respect to ¢ and setting t= * gives 


d®x(s*) b(t") 
de = —bx(t \—- 


This quantity is positive. Hence, x(#) has a minimum at t=1/*. But this is 
impossible, since x(¢) is always increasing whenever x(t), y(f) is in region 
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I. Thus, any solution x(1), y(t) of (2) which starts in region I at time t= 
will remain in region I for all future time ¢ > fo. This implies that x(t) is a 
monotonic increasing function of time, and y(#) is a monotonic decreasing 
function of time for > ft), with x(t)<a/e and y(t) >0. Consequently, by 
Lemma 1, both x(#) and y(#) have limits £,7 respectively, as ¢ approaches 
infinity. Lemma 2 implies that (€,7) is an equilibrium point of (2). Now, it 
is easily verified that the only equilibrium points of (2) in the region x > 0, 
y20Oare x=0, y=0, and x=a/e, y=0. Clearly, cannot equal zero since 
x(t) is increasing in region I. Therefore, £=a/e and n=0. 0 


Lemma 4. Any solution x(t), y(t) of (2) which starts in region III at time t= 
ty must leave this region at some later time. 


PRooF. Suppose that a solution x(f), y(2) of (2) remains in region III for 
all time ¢ > fo. Then, x(t) is a monotonic decreasing function of time, and 
y(t) is a monotonic increasing function of time, for ¢ > tp. Moreover, x(Z) is 
greater than c/d and y(t) is less than (dx(t))—c)/f. Consequently, both 
x(t) and y(t) have limits €,y respectively, as ¢ approaches infinity. Lemma 
2 implies that (£7) is an equilibrium point of (2). But (£7) cannot equal 
(0,0) or (a/e,0) if x(t), y(t) is in region III for t >t). This contradiction 
establishes Lemma 4. O 


Lemma 5. Any solution x(t), y(t) of (2) which starts in region II at time t= 
ty and remains in region II for all future time t> tg must approach the 
equilibrium solution x =a/e, y =0. 


ProoF. Suppose that a solution x(t), y(t) of (2) remains in region II for all 
time ¢ > ty. Then, both x(t) and y(¢) are monotonic decreasing functions of 
time for t > fo, with x(¢) >0 and y(t) >0. Consequently, by Lemma 1, both 
x(t) and y(f) have limits £,7 respectively, as t approaches infinity. Lemma 
2 implies that (£7) is an equilibrium point of (2). Now, (§,7) cannot equal 
(0,0). Therefore, = a/e,n=0. 


Lemma 6. A solution x(t),y(t) of (2) cannot enter region III from region II. 


ProoF. Suppose that a solution x(¢), y(t) of (2) leaves region II at time ¢= 
t* and enters region III. Then, y(t*)=0. Differentiating both sides of the 
second equation of (2) with respect to ¢ and setting t=/* gives 


d*y(t*) . dx (t*) 
ro aa) 


This quantity is negative. Hence, y(¢) has a maximum at t=/*. But this is 
impossible, since y(t) is decreasing whenever x(f), y(t) is inregion II. 


Finally, observe that a solution x(Z), y(2) of (2) which starts on /, must 
immediately enter region I, and that a solution which starts on /, must 1m- 
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mediately enter region II. It now follows immediately from Lemmas 3-6 
that every solution x(t), y(t) of (2), with x(0)>0 and y(0)>0, approaches 
the equilibrium solution x =a/e, y=0 as ¢ approaches infinity. 

Up to now, the solutions and orbits of the nonlinear equations that we 
have studied behaved very much like the solutions and orbits of linear 
equations. In actual fact, though, the situation is very different. The solu- 
tions and orbits of nonlinear equations, in general, exhibit a completely 
different behavior than the solutions and orbits of linear equations. A 
standard example is the system of equations 
= +x(1—x?-y?’), @ ax+y(I-x?-y?) (3) 
Since the term x?+,y? appears prominently in both equations, it suggests 
itself to introduce polar coordinates r,@, where x =rcos8, y=rsin@, and to 
rewrite (3) in terms of r and @. To this end, we compute 

d._.4 ar dx dy 


a =2r7 = 2x7 + 2y7- 


=2(x?+y?)—2(x2+y?) =272(1— 72). 


Similarly, we compute 


yd 
dd tan Zab a at xt _, 
at dt XxX x2 1+(y/xy x*+y? ° 


Consequently, the system of equations (3) is equivalent to the system of 
equations 

ar ig? a 

7 r(l-r’), l. (4) 
The general solution of (4) is easily seen to be 

‘9 
r(t)= ———__—_——__, 6=t+ 0, (5) 
| rot (1- rye] 


where ro>=r(0) and 8)= 8 (0). Hence, 


lo 
x(t) = ——————— cos(t + 9p), 
[8+ (1-r3)e-2]'7 
'9 ; 
y(t) = ———— sin(t + 9). 


[ro+(1 — yey” 


Now, observe first that x =0, y=0 is the only equilibrium solution of (3). 
Second, observe that 


x(t)=cos(t+6)),  y(t)=sin(t+ 4) 
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Figure 2. The phase portrait of (3) 


when 7)>=1. This solution is periodic with period 27, and its orbit is the 
unit circle x*+y?= 1. Finally, observe from (5) that r(¢) approaches one as 
t approaches infinity, for r>0. Hence, all the orbits of (3), with the excep- 
tion of the equilibrium point x=0, y=0, spiral into the unit circle. This 
situation is depicted in Figure 2. 

The system of equations (3) shows that the orbits of a nonlinear system 
of equations may spiral into a simple closed curve. This, of course, is not 
possible for linear systems. Moreover, it is often possible to prove that 
orbits of a nonlinear system spiral into a closed curve even when we 
cannot explicitly solve the system of equations, or even find its orbits. This 
is the content of the following celebrated theorem. 


Theorem 5. (Poincaré—Bendixson.) Suppose that a solution x= x(t), y= 
y(t) of the system of differential equations 


4 f(xy), Fm g(xy) (6) 


remains in a bounded region of the plane which contains no equilibrium 
points of (6). Then, its orbit must spiral into a simple closed curve, which is 
itself the orbit of a periodic solution of (6). 


Example 2. Prove that the second-order differential equation 
Z+(z7+2z?—1)z+z=0 (7) 
has a nontrivial periodic solution. 
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Solution. First, we convert Equation (7) to a system of two first-order 
equations by setting x =z and y=Z. Then, 
dx _ y__ _ ¥2_9,,2 
7” a x+(1—x?-2y?)y. (8) 
Next, we try and find a bounded region R in the x — y plane, containing no 
equilibrium points of (8), and having the property that every solution x(t), 
y(t) of (8) which starts in R at time ¢= 1/5, remains there for all future time 
t > to. It can be shown that a simply connected region such as a square or 
disc will never work. Therefore, we try and take R to be an annulus 
surrounding the origin. To this end, compute 
d[{[x°*t+y?\__ ax WY 2 4.2\.2 
( 5 JaxB + =(I x?—2y?) y?, 


dt 


and observe that 1— x*—2y? is positive for x?+y*<4 and negative for 
x*+y?>1. Hence, x?7(¢)+.y?(#) is increasing along any solution x(¢), y(t) 
of (8) when x?+y*<4 and decreasing when x?+y?> 1. This implies that 
any solution x(t), y(t) of (8) which starts in the annulus }<x*+y?<1 at 
time ¢=f, will remain in this annulus for all future time ¢> ft). Now, this 
annulus contains no equilibrium points of (8). Consequently, by the 
Poincaré-Bendixson Theorem, there exists at least one periodic solution 
x(t), y(t) of (8) lying entirely in this annulus, and then z = x(f) 1s a nontn- 
vial periodic solution of (7). 


EXERCISES 


1. What Really Happened at the Paris Peace Talks 

The original plan developed by Henry Kissinger and Le Duc Tho to settle the 
Vietnamese war is described below. It was agreed that 1 million South Viet- 
namese ants and | million North Vietnamese ants would be placed in the back- 
yard of the Presidential palace in Paris and be allowed to fight it out for a long 
period of time. If the South Vietnamese ants destroyed nearly all the North 
Vietnamese ants, then South Vietnam would retain control of all of its land. If 
the North Vietnamese ants were victorious, then North Vietnam would take 
over all of South Vietnam. If they appeared to be fighting to a standoff, then 
South Vietnam would be partitioned according to the proportion of ants re- 
maining. Now, the South Vietnamese ants, denoted by S, and the North Viet- 
namese ants, denoted by N, compete against each other according to the 
following differential equations: 


dS_ 1, 1 
10> 5 XN ( 
aN 1 yy oy Nn. 


‘dt 100 100°. 100 


Note that these equations correspond to reality since the South Vietnamese 
ants multiply much more rapidly than the North Vietnamese ants, but the 
North Vietnamese ants are much better fighters. 
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The battle began at 10:00 sharp on the morning of May 19, 1972, and was 
supervised by a representative of Poland and a representative of Canada. At 
2:43 p.m. on the afternoon of May 21, the representative of Poland, being un- 
happy with the progress of the battle, slipped a bag of North Vietnamese ants 
into the backyard, but he was spotted by the eagle eyes of the representative of 
Canada. The South Vietnamese immediately claimed a foul and called off the 
agreement, thus setting the stage for the protracted talks that followed in Paris. 
The representative of Poland was hauled before a judge in Paris for sentencing. 
The judge, after making some remarks about the stupidity of the South Viet- 
namese, gave the Polish representative a very light sentence. Justify mathemati- 
cally the judge’s decision. Hint: 

(a) Show that the lines N=2 and N+ S=1 divide the first quadrant into three 
regions (see Figure 3) in which dS /dt and dN /dt have fixed signs. 

(b) Show that every solution S(t), N (¢) of (*) which starts in either region I or 
region III must eventually enter region II. 

(c) Show that every solution S(t), N (¢) of (*) which starts in region II must re- 
main there for all future time. 

(d) Conclude from (c) that S(#)—>0o for all solutions S(¢),N(¢) of (*) with 
S (to) and N (fo) positive. Conclude too that N(#) has a finite limit (< 2) as 
[> 00. 

(e) To prove that N(t)->0, observe that there exists f9 such that dN/dt< —N 
for t > fo. Conclude from this inequality that N(#)->0 as to. 


Figure 3 


2. Consider the system of differential equations 


dx dy 
ON, Gy dye” (*) 


with a/b>c/e. Prove that y(t)0 as too, for every solution x(¢), y(t) of (*) 
with x(t.) and y(t) positive. Hint: Follow the outline in Exercise 1. 


3. (a) Without computing the eigenvalues of the matrix 
(T -3) 
1 -3/ 
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Figure 4 


prove that every solution x(f) of 


.__({ —3 | 

x ( 1 -3 )x 
approaches zero as ¢ approaches infinity. Hint: (a) Show that the lines x. = 
3x, and x, =3.x, divide the x, — x, plane into four regions (see Figure 4) in 
which x, and x, have fixed signs. 

(b) Show that every solution x(t) which starts in either region I or II must re- 
main there for all future time and ultimately approach the equilibrium solu- 
tion x=0. 

(c) Show that every solution x(t) which remains exclusively in region III or IV 
must ultimately approach the equilibrium solution x=0. 


A closed curve C is said to be a limit cycle of 
X=f(x,y), y=ge(x,y) (*) 


if orbits of (*) spiral into it, or away from it. It is stable if all orbits of (*) 
passing sufficiently close to it must ultimately spiral into it, and unstable 
otherwise. Find all limit cycles of each of the following systems of differen- 
tial equations. (Hint: Compute d(x*+y*)/dt. Observe too, that C must be 
the orbit of a periodic solution of (*) if it contains no equilibrium points of 


(*).) 


x(x?+y?—2) 
4, x= -y- ——_—_—_—_ 5. x=x—x3— xy? 
VV x+y? 
3 yoy—y—yx? 
y(x*+y*—2) 
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6. X=y+ x(x? +y?—1)(x?+y?—2) 7. X= xy + xcos(x?+y7) 
y=-—xty(x?+y?—-1)(x?+y?-2) y=—x?+ycos(x?+y’) 
8. (a) Show that the system 
kaytaf(n/r,  yarxtyf()/r (PP =x? +y’) (*) 


has limit cycles corresponding to the zeros of f(r). What is the direction of 
motion on these curves? 

(b) Determine all limit cycles of (*) and discuss their stability if f(r)= 
(r—3)°(r?—5r+4). 


Use the Poincaré-Bendixson Theorem to prove the existence of a nontri- 
vial periodic solution of each of the following differential equations. 
9, 7+(27+24-—2)z=0 10. Z+[In(z?+427)]z+z=0 


11. (a) According to Green’s theorem in the plane, if C is a closed curve which 1s 
sufficiently “smooth,” and if f and g are continuous and have continuous 
first partial derivatives, then 


$f (ay)d—a(xy)dx]= ff [fe Gay) +8, (x,y) ]dxdy 
R 


where R is the region enclosed by C. Assume that x(t), y(¢) is a periodic 
solution of x= f(x,y), y=g(x,y), and let C be the orbit of this solution. 
Show that for this curve, the line integral above is zero. 

(b) Suppose that f.+g, has the same sign throughout a simply connected re- 
gion D in the x—y plane. Show that the system of equations x= f(x,y), 
y =g(x,y) can have no periodic solution which is entirely in D. 


12. Show that the system of differential equations 
X=xty74+x3, ype-xtytyx? 
has no nontrivial periodic solution. 
13. Show that the system of differential equations 
X=x—xy?+y3, yp=3y —yx2+ x? 
has no nontrivial periodic solution which lies inside the circle x7 + y?=4. 


14, (a) Show that x =0, y =y(?) is a solution of (2) for any function y(f) satisfying 
y=—cy—fy’. 
(b) Choose (tp) > 0. Show that the orbit of x =0, y= Y(¢) (for all ¢ for which y 
exists) is the positive y axis. 


4.9 Introduction to bifurcation theory 


Consider the system of equations 


x = f(x, e) (1) 


where 
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and ¢ is a scalar. Intuitively speaking a bifurcation point of (1) is a value of € 
at which the solutions of (1) change their behavior. More precisely, we say 
that e= e, is a bifurcation point of (1) if the phase portraits of (1) for e< e, 
and e>e, are different. 


Remark. In the examples that follow we will appeal to our intuition in 
deciding whether two phase portraits are the same or are different. In more 
advanced courses we define two phase portraits to be the same, or topologi- 
cally equivalent, if there exists a continuous transformation of the plane 
onto itself which maps one phase portrait onto the other. 


Example 1. Find the bifurcation points of the system 


= Ax=(1 _t)x (2) 


Solution. The characteristic polynomial of the matrix A is 
p(A)=det(A— AI) 


= det 17 . 


] —-1-A 
=(A-1)(At+1)—e 
=’ —(1+e). 


The roots of p(A) are +¥1l+e for e>—1, and +¥—e-—1i for e<—1. This 
Lx, =(1-vVv1—e) x, 


>: 
x, =(1+V1+.e) x, 


Figure 1. Phase portrait of (2) for e>—1 
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Figure 2. Phase portrait of (2) for e= — 1 


implies that x =0 is a saddle for e>—1, and a center for e<—1l. We 
conclude, therefore, that e= — 1 is a bifurcation point of (2). It is also clear 
that Eq. (2) has no other bifurcation points. 

It is instructive to see how the solutions of (2) change as € passes through 
the bifurcation value — 1. For e>—1, the eigenvalues of A are 


A, =ylte, A,=—ylte. 


It is easily verified (see Exercise 10) that 


I+ylt+e 
] 
is an eigenvector of A with eigenvalue ¥1+ e€, while 


e= [rie 


is an eigenvector with eigenvalue —y1+. Hence, the phase portrait of (2) 
has the form shown in Figure 1. As e— —1 from the left, the lines /, and /, 
both approach the line x, = x,. This line is a line of equilibrium points of 
(2) when e= —1, while each line x, — x, =c (c #0) is an orbit of (2) for 
e= — |. The phase portrait of (2) for e= — 1 is given in Figure 2. 


1 — 


Example 2. Find the bifurcation points of the system 
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Solution. The characteristic polynomial of the matrix A is 


p(A) = det(A— AI) = det ~A “a .| 


=A(1+A)+e 
=MV+At+e 


and the roots of p(A) are 


—l1+y1—4e \ —l—yl—4e 


A= 2 an 2 


Observe that A, is positive and A, is negative for e<0. Hence, x =0 is a 
saddle for e< 0. For 0<e<1/4, both A, and A, are negative. Hence x = 0 
is a Stable node for 0<e<1/4. Both A, and A, are complex, with negative 
real part, for e>1/4. Hence x = 0 is a stable focus for e>1/4. Note that the 
phase portrait of (3) changes as e passes through 0 and 1/4. We conclude, 
therefore, that e= 0 and e=1/4 are bifurcation points of (3). 


Example 3. Find the bifurcation points of the system of equations 


dx, 

—-{x 

dt 2 

Ie (4) 


Solution. (i) We first find the equilibrium points of (4). Setting dx, /dt =0 
gives x, = 0, and then setting dx, /dt = 0 gives x? — e=0, so that x, ==+ye, 
e>0. Hence, (ve ,0) and (—ve,0) are two equilibrium points of (4) for 
e>0. The system (4) has no equilibrium points when e<0. We conclude, 
therefore, that e=0 is a bifurcation point of (4). (ii) We now analyze the 
behavior of the solutions of (4) near the equilibrium points (+ye ,0) to 
determine whether this system has any additional bifurcation points. Setting 


u=x,*\e, v=X, 


gives 
du _ 
dt 
a (5) 
= = (uve) —p—e=+2feu—v+u?. 
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The system (5) can be written in the form 


d 0 ] 


G(8)-(22@ -i}(4+(2) 


By Theorem 4, the phase portrait of (4) near the equilibrium solution 


is determined by the phase portrait of the linearized system 


0 ] 


f(u)=a(4)=( 3 -](8) 
To find the eigenvalues of A we compute 


p(A) =det(A— AI) 


—X l 
+2ye —1-A 


=N+AF2/e. 


= det 


Hence, the eigenvalues of A when u=x,—Ve are 


—1+/1+8ye —]—14 8/e 


ee (6) 


while the eigenvalues of A when u=x,+Ve are 


—1+/1-8/e —1—1—8ye 
=, = (7) 
Observe from (6) that A, > 0, while A, <0. Thus, the system (4) behaves like 


ve 


a saddle near the equilibrium points . On the other hand, we see from 


(7) that both A, and A, are negative for 0<e<1/64, and complex for 
e>1/64. Consequently, the system (4) near the equilibrium solution 

ve behaves like a stable node for 0<e<1/64, and a stable focus for 
e>1/64. It can be shown that the phase portraits of a stable node and a 
stable focus are equivalent. Consequently, e= 1/64 is not a bifurcation point 
of (4). 
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Another situation which is included in the context of bifurcation theory, 
and which is of much current research interest now, is when the system (1) 
has a certain number of equilibrium, or periodic, solutions for e= €), and a 
different number for e#eé,). Suppose, for example, that x(¢) is an equi- 
librium, or periodic, solution of (1) for e=0, and x'(r),x?(t),...,x*(t) are 
equilibria, or periodic solutions of (1) for e+0 which approach x(t) as 
e— 0. In this case we say that the solutions x'(t),x?(t),...,x*(t) bifurcate 
from x(t). We illustrate this situation with the following example. 


Example 4. Find all equilibrium solutions of the system of equations 


— = 3ex, —3ex, — x? — x3 
b (8) 
Hoe 1X2 =x,(e—x,). 


x 
Solution. Let | x, | be an equilibrium solution of the system (8). The second 
equation of (8) implies that x, =0 or x, =e. 


x, =0. In this case, the first equation of (8) implies that 
0 = 3ex, +x? =x,(3e+x,) 


so that x, =0 or x, =—3e. Thus (° and ( * are two equilibrium 
points of (8). 


x,=e. In this case, the first equation of (8) implies that 
3ex, —3e* —x?— 2? =0 
or 
x? —3ex,+4e? =0. (9) 


The solutions 
3€ +7 9e* — 16e 
x, EF 
2 


of (9) are complex. Thus, 


are two equilibrium points of (8), for e+ 0, which bifurcate from the single 


equilibrium point x = 0 when e= 0. 
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EXERCISES 


Find the bifurcation points of each of the following systems of equations. 


1 x=(! | 2x=(! 7 

e | e | 
_{ 0 2 , [0 | 

3 x=(° >) 4. x (5 E * 
. {0 —e 

5.x=( 0 Jx 


In each of Problems 6-8, show that more than one equilibrium solutions 
bifurcate from the equilibrium solution x = 0 when e= 0. 


6. X, = ex, — ex, — x7 + x5 7. X= ex, — x? — x 1x, 

Xy = EXT X)X> X= —2ex, +2ex,+x,x,— x? 
8. xX, = ex, +Xx)x 

Xo =— ex, tex, tx? +x} 


9. Consider the system of equations 
X= 3ex,—Sex, —x?+ x3 
Xy = 2EX, — EX. (*) 
(a) Show that each point on the lines x, =x, and x, =— x, are equilibrium 
points of (*) for e= 0. 


(b) Show that 
S)-(0) = oy 
(x (5 and x2) 3*\ 2] 
are the only equilibrium points of (*) for e+ 0. 


10. Show that 


1+,/l+.e¢ 
l 


= 


and a= [tvi 


] 


are eigenvectors of the matrix ( | | ] with eigenvalues yl+e and —vyl+e 


respectively. 


4.10 Predator—prey problems; or why the 
percentage of sharks caught in the 
Mediterranean Sea rose dramatically 
during World War I 


In the mid 1920’s the Italian biologist Umberto D’Ancona was studying 
the population variations of various species of fish that interact with each 
other. In the course of his research, he came across some data on per- 
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centages-of-total-catch of several species of fish that were brought into dif- 
ferent Mediterranean ports in the years that spanned World War I. In par- 
ticular, the data gave the percentage-of-total-catch of selachians, (sharks, 
skates, rays, etc.) which are not very desirable as food fish. The data for 
the port of Fiume, Italy, during the years 1914-1923 is given below. 


1914 1915 1916 1917 1918 
11.9% 21.4% 22.1% 21.2% 36.4% 


1919 1920 1921 1922 1923 
27.3% 16.0% 15.9% 14.8% 10.7% 


D’Ancona was puzzled by the very large increase in the percentage of 
selachians during the period of the war. Obviously, he reasoned, the in- 
crease in the percentage of selachians was due to the greatly reduced level 
of fishing during this period. But how does the intensity of fishing affect 
the fish populations? The answer to this question was of great concern to 
D’Ancona in his research on the struggle for existence between competing 
species. It was also of concern to the fishing industry, since it would have 
obvious implications for the way fishing should be done. 

Now, what distinguishes the selachians from the food fish is that the 
selachians are predators, while the food fish are their prey; the selachians 
depend on the food fish for their survival. At first, D’Ancona thought that 
this accounted for the large increase of selachians during the war. Since the 
level of fishing was greatly reduced during this period, there were more 
prey available to the selachians, who therefore thrived and multiplied 
rapidly. However, this explanation does not hold any water since there 
were also more food fish during this period. D’Ancona’s theory only shows 
that there are more selachians when the level of fishing is reduced; it does 
not explain why a reduced level of fishing is more beneficial to the preda- 
tors than to their prey. 

After exhausting all possible biological explanations of this phenome- 
non, D’Ancona turned to his colleague, the famous Italian mathematician 
Vito Volterra. Hopefully, Volterra would formulate a mathematical model 
of the growth of the selachians and their prey, the food fish, and this 
model would provide the answer to D’Ancona’s question. Volterra began 
his analysis of this problem by separating all the fish into the prey popula- 
tion x(¢) and the predator population y(t). Then, he reasoned that the food 
fish do not compete very intensively among themselves for their food 
supply since this is very abundant, and the fish population is not very 
dense. Hence, in the absence of the selachians, the food fish would grow 
according to the Malthusian law of population growth x=ax, for some 
positive constant a. Next, reasoned Volterra, the number of contacts per 
unit time between predators and prey is bxy, for some positive constant b. 
Hence, x = ax — bxy. Similarly, Volterra concluded that the predators have 
a natural rate of decrease —cy proportional to their present number, and 
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that they also increase at a rate dxy proportional to their present number y 
and their food supply x. Thus, 


dx dy __ 
yo bxy, a cy + dxy. (1) 


The system of equations (1) governs the interaction of the selachians 
and food fish in the absence of fishing. We will carefully analyze this sys- 
tem and derive several interesting properties of its solutions. Then, we will 
include the effect of fishing in our model, and show why a reduced level of 
fishing is more beneficial to the selachians than to the food fish. In fact, we 
will derive the surprising result that a reduced level of fishing is actually 
harmful to the food fish. 

Observe first that (1) has two equilibrium solutions x(t)=0, y(t) =0 and 
x(t)=c/d, y(t)=a/b. The first equilibrium solution, of course, is of no in- 
terest to us. This system also has the family of solutions x(t)= x e”, y(O= 
0 and x(t)=0, y(t)=y,e “. Thus, both the x and y axes are orbits of (1). 
This implies that every solution x(t), y(t) of (1) which starts in the first 
quadrant x >0, y>0 at time ¢=¢, will remain there for all future time ¢ > 
to. 

The orbits of (1), for x,y#0 are the solution curves of the first-order 
equation 


et aecbe lb (2) 


This equation is separable, since we can write it in the form 


a— by D _ —ctdx 
y ax x 


Consequently, alny — by +clnx—dx=k,, for some constant k,. Taking 
exponentials of both sides of this equation gives 


a aok 3) 


e® et 


for some constant K. Thus, the orbits of (1) are the family of curves de- 
fined by (3), and these curves are closed as we now show. 


Lemma 1. Equation (3) defines a family of closed curves for x, y >9. 


PRrooF. Our first step is to determine the behavior of the functions f(y)= 
yt*/e” and g(x)=x‘°/e™ for x and y positive. To this end, observe that 
f(0)=0, f(co)=0, and f(y) is positive for y >0. Computing 


a—l a a—l — 
ay*” * — by y*~*(a- by) 
(= = —, 


we see that f(y) has a single critical point at y=a/b. Consequently, f(y) 
achieves its maximum value M,=(a/b)*/e* at y=a/b, and the graph of 


9 


ey» ey 
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fly) g(x) 


a/b c/d 
(a) (b) 
Figure 1. (a) Graph of f(y)=y%e~”; (b) Graph of g(x)=x‘°e~® 


f(y) has the form described in Figure la. Similarly, g(x) achieves its maxi- 
mum value M,=(c/d)°/e° at x=c/d, and the graph of g(x) has the form 
described in Figure 1b. 

From the preceding analysis, we conclude that Equation (3) has no solu- 
tion x,y >0 for K > M,M,, and the single solution x =c/d, y=a/b for K 
= M,,.M,. Thus, we need only consider the case K=AM,, where 4 is a posi- 
tive number less than M,. Observe first that the equation x°/e” =A has 
one solution x=x,,<c/d, and one solution x =x, >c/d. Hence, the 
equation 


f(y)=y%e7 = ( Sa |, 


has no solution y when x is less than x,, or greater than x,,. It has the sin- 
gle solution y=a/b when x=x,, or x,,, and it has two solutions y,(x) and 
y2(x) for each x between x,, and x,y. The smaller solution y,(x) is always 
less than a/b, while the larger solution y,(x) is always greater than a/b. 
As x approaches either x,, or x,,, both y,(x) and y,(x) approach a/b. Con- 
sequently, the curves defined by (3) are closed for x and y positive, and 
have the form described in Figure 2. Moreover, none of these closed curves 
(with the exception of x =c/d, y=a/b) contain any equilibrium points of 
(1). Therefore, all solutions x(t), y(t) of (1), with x(0) and y(0) positive, 
are periodic functions of time. That is to say, each solution x(t), y(t) of 
(1), with x(0) and y(0) positive, has the property that x(t+ T)= x(t) and 
y(t+T)=y(2) for some positive T. O 


Now, the data of D’Ancona is really an average over each one year 
period of the proportion of predators. Thus, in order to compare this data 
with the predictions of (1), we must compute the “average values” of x(t) 
and y(¢), for any solution x(t), y(t) of (1). Remarkably, we can find these 
average values even though we cannot compute x(f) and y(t) exactly. This 
is the content of Lemma 2. 
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a/b 


Xm c/d X 


Figure 2. Orbits of (1) for x,y positive 


Lemma 2. Let x(t), y(t) be a periodic solution of (1), with period T >0. De- 
fine the average values of x and y as 


--1 7 -_l/* 
x= al x(t)dt, j= al y(t)dt. 


Then, x =c/d and y=a/b. In other words, the average values of x(t) and 
y(t) are the equilibrium values. 


ProorF. Dividing both sides of the first equation of (1) by x gives x/x= 
a— by, so that 


Now, f, "x(t) /x(t)dt=Inx(T)—Inx(0), and this equals zero since x(T)= 
x(0). Consequently, 


1 rT _1 T _ 
a by (t) dt = a adt=a, 


so that y=a/b. Similarly, by dividing both sides of the second equation of 
(1) by Ty(t) and integrating from 0 to T, we obtain that x=c/d. oO 


We are now ready to include the effects of fishing 1n our model. Ob- 
serve that fishing decreases the population of food fish at a rate ex(t), and 
decreases the population of selachians at a rate ey(t). The constant e re- 
flects the intensity of fishing; i.e., the number of boats at sea and the num- 
ber of nets in the water. Thus, the true state of affairs is described by the 
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modified system of differential equations 
a = ax — bxy — ex =(a—)x — bxy 

dy (4) 

a —cy + dxy —-ey= —(ct+e)y+dxy. 

This system is exactly the same as (1) (for a—e>0), with a replaced by 

a—e, and c replaced by c+e. Hence, the average values of x(t) and y(t) 

are now 


—- CTE —~_ aE 


Consequently, a moderate amount of fishing (e <a) actually increases the 
number of food fish, on the average, and decreases the number of 
selachians. Conversely, a reduced level of fishing increases the number of 
selachians, on the average, and decreases the number of food fish. This re- 
markable result, which is known as Volterra’s principle, explains the data 
of D’Ancona, and completely solves our problem. 

Volterra’s principle has spectacular applications to insecticide treat- 
ments, which destroy both insect predators and their insect prey. It implies 
that the application of insecticides will actually increase the population of 
those insects which are kept in control by other predatory insects. A re- 
markable confirmation comes from the cottony cushion scale insect 
(Icerya purchasi), which, when accidentally introduced from Australia in 
1868, threatened to destroy the American citrus industry. Thereupon, its 
natural Australian predator, a ladybird beetle (Novius Cardinalis) was in- 
troduced, and the beetles reduced the scale insects to a low level. When 
DDT was discovered to kill scale insects, it was applied by the orchardists 
in the hope of further reducing the scale insects. However, in agreement 
with Volterra’s principle, the effect was an increase of the scale insect! 

Oddly enough, many ecologists and biologists refused to accept 
Volterra’s model as accurate. They pointed to the fact that the oscillatory 
behavior predicted by Volterra’s model is not observed in most pre- 
dator—prey systems. Rather, most predator-prey systems tend to 
equilibrium states as time evolves. Our answer to these critics is that the 
system of differential equations (1) is not intended as a model of the gen- 
eral predator-prey interaction. This is because the food fish and selachians 
do not compete intensively among themselves for their available resources. 
A more general model of predator-prey interactions is the system of dif- 
ferential equations — 


X=ax—bxy—ex*, y=—cyt+dxy—fy’. (6) 
Here, the term ex? reflects the internal competition of the prey x for their 
limited external resources, and the term fy? reflects the competition among 
the predators for the limited number of prey. The solutions of (6) are not, 
in general, periodic. Indeed, we have already shown in Example 1 of Sec- 


448 


4.10 Predator—prey problems 


tion 4.8 that all solutions x(t), y(Z) of (6), with x(0) and y(Q) positive, 
ultimately approach the equilibrium solution x=a/e, y=0 if c/d is 
greater than a/e. In this situation, the predators die out, since their availa- 
ble food supply is inadequate for their needs. 

Surprisingly, some ecologists and biologists even refuse to accept the 
more general model (6) as accurate. As a counterexample, they cite the ex- 
periments of the mathematical biologist G. F. Gause. In these experiments, 
the population was composed of two species of protozoa, one of which, Di- 
dinium nasatum, feeds on the other, Paramecium caudatum. In all of 
Gause’s experiments, the Didinium quickly destroyed the Paramecium and 
then died of starvation. This situation cannot be modeled by the system of 
equations (6), since no solution of (6) with x(0) y(0)40 can reach x =0 or 
y =0 in finite time. 

Our answer to these critics is that the Didinium are a special, and atypi- 
cal type of predator. On the one hand, they are ferocious attackers and re- 
quire a tremendous amount of food; a Didinium demands a fresh Para- 
mecium every three hours. On the other hand, the Didinium don’t perish 
from an insufficient supply of Paramecium. They continue to multiply, but 
give birth to smaller offspring. Thus, the system of equations (6) does not 
accurately model the interaction of Paramecium and Didinium. A better 
model, in this case, is the system of differential equations 

dx 


dy _|dVxy, x#0 
< =ax—bV — = 7 , 7 
dt ov y, at 7) 


It can be shown (see Exercise 6) that every solution x(t), y(¢) of (7) with 
x(0) and y(0) positive reaches x =0 in finite time. This does not contradict 
the existence—uniqueness theorem, since the function 


eay)= | 0%» x40 
— cy, x=0 


— cy, x=0 


does not have a partial derivative with respect to x or y, at x=0. 

Finally, we mention that there are several predator—prey interactions in 
nature which cannot be modeled by any system of ordinary differential 
equations. These situations occur when the prey are provided with a refuge 
that is inaccessible to the predators. In these situations, it is impossible to 
make any definitive statements about the future number of predators and 
prey, since we cannot predict how many prey will be stupid enough to 
leave their refuge. In other words, this process is now random, rather than 
deterministic, and therefore cannot be modeled by a system of ordinary dif- 
ferential equations. This was verified directly in a famous experiment of 
Gause. He placed five Paramecium and three Didinium in each of thirty 
identical test tubes, and provided the Paramecium with a refuge from the 
Didinium. Two days later, he found the predators dead in four tubes, and 
a mixed population containing from two to thirty-eight Paramecium in the 
remaining twenty-six tubes. 
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Volterra, V: “Lecons sur la théorie mathématique de la lutte pour la vie.” Paris, 


1931. 


EXERCISES 


1. 


2. 


5. 


Find all biologically realistic equilibrium points of (6) and determine their stabil- 
ity. 


We showed in Section 4.8 that y(¢) ultimately approaches zero for all solutions 
x(t), y(t) of (6), if c/d >a/e. Show that there exist solutions x(t), y(¢) of (6) for 
which y(t) increases at first to a maximum value, and then decreases to zero. (To 
an observer who sees only the predators without noticing the prey, such a case 
of a population passing through a maximum to total extinction would be very 
difficult to explain.) 


. In many instances, it is the adult members of the prey who are chiefly attacked 


by the predators, while the young members are better protected, either by their 
smaller size, or by their living in a different station. Let x, be the number of 
adult prey, x, the number of young prey, and y the number of predators. Then, 
x,= — A,X + Ax, — bx, y 
%_= nx, —(a,+a2)x2 
y=-—cytdxyy 
where a,x, represents the number of young (per unit time) growing into adults, 


and n represents the birth rate proportional to the number of adults. Find all 
equilibrium solutions of this system. 


. There are several situations in nature where species 1 preys on species 2 which in 


turn preys on species 3. One case of this kind of population is the Island of 
Komodo in Malaya which is inhabited by giant carnivorous reptiles, and by 
mammals—their food—which feed on the rich vegetation of the island. We 
assume that the reptiles have no direct influence on the vegetation, and that only 
the plants compete among themselves for their available resources. A system of 
differential equations governing this interaction is 


X= — A,X — by 2X X_ + €43X1%3 
X= — 2X2 + by) X\x2 


= a 
X3 = A3X3— AgX3 — C31 1X3 


Find all equilibrium solutions of this system. 


Consider a predator-prey system where the predator has alternate means of sup- 
port. This system can be modelled by the differential equations 

X= 0X1 (By — X41) + Y1%1%2 

X= AxX( By — x2) — 2X 1%2 


where x,(¢) and x,(t) are the predators and prey populations, respectively, at 
time f¢. 
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(a) Show that the change of coordinates B, y;(t) = x,(t/a;B;) reduces this system 
of equations to 


Y= -y)t+ayiyr2 Yo=V2(1-y2)— V1 V2 
where a,= y; 8,/a, 8, and a= y28,/a Bp. 
(b) What are the stable equilibrium populations when (i) 0<a)< 1, (ii) a,>1? 
(c) It is observed that a, =3a, (a, is a measure of the aggressiveness of the pre- 
dator). What is the value of a, if the predator’s instinct is to maximize its 
stable equilibrium population? 


6. (a) Let x(t) be a solution of X=ax— MVx , with M>aV/ x(to) . Show that 
aVx =M- (M- aW x(t) Jerr 10)/2, 


(b) Conclude from (a) that x(¢) approaches zero in finite time. 

(c) Let x(t), y(t) be a solution of (7), with by(to.) >a V x(to) . Show that x(t) 
reaches zero in finite time. Hint: Observe that y(t) is increasing for t > fo. 

(d) It can be shown that by(t) will eventually exceed a \/ x(t) for every solu- 


tion x(t), y(t) of (7) with x(to) and y(to) positive. Conclude, therefore, that 
all solutions x(t), y(t) of (7) achieve x =0 in finite time. 


4.11 The principle of competitive exclusion 
in population biology 


It is often observed, in nature, that the struggle for existence between two 
similar species competing for the same limited food supply and living 
space nearly always ends in the complete extinction of one of the species. 
This phenomenon is known as the “principle of competitive exclusion.” It 
was first enunciated, in a slightly different form, by Darwin in 1859. In his 
paper “The origin of species by natural selection’ he writes: “As the species 
of the same genus usually have, though by no means invariably, much sim- 
ilarity in habits and constitutions and always in structure, the struggle will 
generally be more severe between them, if they come into competition with 
each other, than between the species of distinct genera.” 

There is a very interesting biological explanation of the principle of 
competitive exclusion. The cornerstone of this theory is the idea of a 
“niche.” A niche indicates what place a given species occupies in a com- 
munity; i.e., what are its habits, food and mode of life. It has been ob- 
served that as a result of competition two similar species rarely occupy the 
same niche. Rather, each species takes possession of those kinds of food 
and modes of life in which it has an advantage over its competitor. If the 
two species tend to occupy the same niche then the struggle for existence 
between them will be very intense and result in the extinction of the 
weaker species. 

An excellent illustration of this theory is the colony of terns inhabiting 
the island of Jorilgatch in the Black Sea. This colony consists of four diffe- 
rent species of terns: sandwich-tern, common-tern, blackbeak-tern, and lit- 


451 


4 Qualitative theory of differential equations 


tle-tern. These four species band together to chase away predators from the 
colony. However, there is a sharp difference between them as regards the 
procuring of food. The sandwich-tern flies far out into the open sea to hunt 
certain species, while the blackbeak-tern feeds exclusively on land. On the 
other hand, common-tern and little-tern catch fish close to the shore. They 
sight the fish while flying and dive into the water after them. The little-tern 
seizes his fish in shallow swampy places, whereas the common-tern hunts 
somewhat further from shore. In this manner, these four similar species of 
tern living side by side upon a single small island differ sharply in all their 
modes of feeding and procuring food. Each has a niche in which it has a 
distinct advantage over its competitors. 

In this section we present a rigorous mathematical proof of the law of 
competitive exclusion. This will be accomplished by deriving a system of 
differential equations which govern the interaction between two similar 
species, and then showing that every solution of the system approaches an 
equilibrium state in which one of the species is extinct. 

In constructing a mathematical model of the struggle for existence be- 
tween two competing species, it is instructive to look again at the logistic 
law of population growth 

AN _ yr pny 

i aN — bN*. (1) 
This equation governs the growth of the population N (¢) of a single species 
whose members compete among themselves for a limited amount of food 
and living space. Recall (see Section 1.5) that N(t) approaches the limiting 
population K=a/b, as t approaches infinity. This limiting population can 
be thought of as the maximum population of the species which the micro- 
cosm can support. In terms of K, the logistic law (1) can be rewritten in the 


form 
dN _ _by)\_ _N\_oy(K-N 


Equation (2) has the following interesting interpretation. When the 
population N is very low, it grows according to the Malthusian law dN / dt 
= aN. The term aN is called the “biotic potential” of the species. It is the 
potential rate of increase of the species under ideal conditions, and it is re- 
alized if there are no restrictions on food and living space, and if the indi- 
vidual members of the species do not excrete any toxic waste products. As 
the population increases though, the biotic potential is reduced by the fac- 
tor (K-—N)/K, which is the relative number of still vacant places in the 
microcosm. Ecologists call this factor the environmental resistance to 
growth. 

Now, let N,(¢) and N,(t) be the population at time ¢ of species 1 and 2 
respectively. Further, let K, and K, be the maximum population of species 
1 and 2 which the microcosm can support, and let a,N, and a,N, be the 
biotic potentials of species 1 and 2. Then, N,(t) and N,(t) satisfy the sys- 
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tem of differential equations 


“det! K, dt? ? K, » ©) 


where m, is the total number of places of the first species which are taken 
up by members of the second species, and m, is the total number of places 
of the second species which are taken up by members of the first species. 
At first glance it would appear that m,= N, and m,=N,. However, this is 
not generally the case, for it is highly unlikely that two species utilize the 
environment in identical ways. Equal numbers of individuals’ of species 1 
and 2 do not, on the average, consume equal quantities of food, take up 
equal amounts of living space and excrete equal amounts of waste prod- 
ucts of the same chemical composition. In general, we must set m,=aN, 
and m,= BN,, for some constants a and £. The constants a and £ indicate 
the degree of influence of one species upon the other. If the interests of the 
two species do not clash, and they occupy separate niches, then both a and 
B are zero. If the two species lay claim to the same niche and are very sim- 
ilar, then a and B are very close to one. On the other hand, if one of the 
species, say species 2, utilizes the environment very unproductively; 1.e., it 
consumes a great deal of food or excretes very poisonous waste products, 
then one individual of species 2 takes up the place of many individuals of 
species |. In this case, then, the coefficient a is very large. 

We restrict ourselves now to the case where the two species are nearly 
identical, and lay claim to the same niche. Then, a= B=1, and N,(¢) and 
N,(t) satisfy the system of differential equations 


aH EN | >? Ys = =a,N,{ ———}._ 4) 

dt K, dt K, 
In this. instance, we expect the struggle for existence between species | and 
2 to be very intense, and to result in the extinction of one of the species. 
This is indeed the case as we now show. 


Theorem 6 (Principle of competitive exclusion). Suppose that K, is greater 
than K,. Then, every solution N,(t), N,(t) of (4) approaches the 
equilibrium solution N, = K,, N,=0 as t approaches infinity. In other 
words, if species | and 2 are very nearly identical, and the microcosm can 
support more members of species 1 than of species 2, then species 2 will 
ultimately become extinct. 


Our first step in proving Theorem 6 is to show that N,(¢) and N,(¢) can 
never become negative. To this end, recall from Section 1.5 that 
K,N, (0) 


MOST OFM Oe 


N,(t)=0 
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is a solution of (4) for any choice of N,(0). The orbit of this solution in the 
N,-N, plane is the point (0,0) for V,(0)=0; the line O< N,< K,, N,=0 for 
0< N,(0)< K,; the point (K,,0) for N,(0)=K,; and the line K,<N,< 00, 
N,=0 for N,(0)> K,. Thus, the N, axis, for N, > 0, is the union of four dis- 
tinct orbits. Similarly, the N, axis, for N,>0, is the union of four dis- 
tinct orbits of (4). This implies that all solutions N,(¢), N,(t) of (4) which 
Start in the first quadrant (N, >0,N,>0) of the N,-N, plane must remain 
there for all future time. 

Our second step in proving Theorem 6 is to split the first quadrant into 
regions in which both dN, /dt and dN, /dt have fixed signs. This is accom- 
plished in the following manner. Let /, and /, be the lines K, - N, — N,=0 
and K,—N,—N,=0, respectively. Observe that dN,/dt is negative if 
(N,,N>) lies above /,, and positive if (N,,N,) lies below /,. Similarly, 
dN,/dt is negative if (N,,N,) lies above 1, and positive if (N,,N,) lies be- 
low /,. Thus, the two parallel lines /, and /, split the first quadrant of the 
N,—N, plane into three regions (see Figure 1) in which both dN,/dt and 
dN,/dt have fixed signs. Both N,(t) and N,(t) increase with time (along 
any solution of (4)) in region I; N,(¢) increases, and N,(t) decreases, with 
time in region II; and both N,(¢) and N,(t) decrease with time in region 
Il. 


Figure 1 


Lemma 1. Any solution N,(t), N2(t) of (4) which starts in region | at t= ty 
must leave this region at some later time. 


PRooF. Suppose that a solution N,(¢), N,(t) of (4) remains in region I for 
all time ¢>¢). This implies that both N,(¢) and N,(¢) are monotonic in- 
creasing functions of time for ¢ 2%, with N,(¢t) and N,(¢) less than K,. 
Consequently, by Lemma | of Section 4.8, both N,(t) and N,(t) have limits 
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£,n respectively, as ¢ approaches infinity. Lemma 2 of Section 4.8 implies 
that (,7) is an equilibrium point of (4). Now, the only equilibrium points 
of (4) are (0,0), (K,,0), and (0, K,), and (£7) obviously cannot equal any of 
these three points. We conclude therefore, that any solution N,(t), N,(t) of 
(4) which starts in region I must leave this region at a later time. 


Lemma 2. Any solution N,(t), N,(t) of (4) which starts in region II at time 
t=, will remain in this region for all future time t > to, and ultimately ap- 
proach the equilibrium solution N, = K,, N,=0. 


PRooF. Suppose that a solution N,(t), N,(t) of (4) leaves region II at time 
t= t*. Then, either N,(t*) or N,(t*) is zero, since the only way a solution of 
(4) can leave region II is by crossing /, or /,. Assume that N,(t*)=0. Dif- 
ferentiating both sides of the first equation of (4) with respect to t and set- 
ting t= 7* gives 

d*N,(t*) | —a,N,(t*) dN, (t*) 

de? ~—<S*K'; ,_i‘(‘«;é‘éi 
This quantity is positive. Hence, N,(¢) has a minimum at t=/*. But this is 
impossible, since N,(t) is increasing whenever a solution N,(1), N2(t) of (4) 
is in region II. Similarly, if N,(t*)=0, then 

d?N,(t*)  —a,N,(t*) dN, (t*) 

dt? 7 K, at 

This quantity is negative, implying that N,(¢) has a maximum at ¢=/*. But 
this is impossible, since N,(/) is decreasing whenever a solution N,(2), N2(¢) 
of (4) is in region II. 

The previous argument shows that any solution N,(t), N,(t) of (4) which 
starts in region II at time t=, will remain in region II for all future time 
t > to. This implies that N,(¢) is monotonic increasing and N,(t) is mono- 
tonic decreasing for ¢ > to, with N,(t)< K, and N,(t)> K,. Consequently, 
by Lemma | of Section 4.8, both N,(¢) and N,(t) have limits &,n respec- 
tively, as ¢ approaches infinity. Lemma 2 of Section 4.8 implies that (€,7) is 
an equilibrium point of (4). Now, (& 7) obviously cannot equal (0,0) or 
(0, K,). Consequently, (€,7) =(K,,0), and this proves Lemma 2. CO 


Lemma 3. Any solution N,(t), N,(t) of (4) which starts in region III at time 
t=, and remains there for all future time must approach the equilibrium 
solution N,(t)= K,, N,(t)=0 as t approaches infinity. 


ProoF. If a solution N,(t), N,(2) of (4) remains in region III for ¢ > fo, then 
both N,(4) and N,(t) are monotonic decreasing functions of time for ft > fo, 
with N,(¢)>0 and N,(t)>0. Consequently, by Lemma 1 of Section 4.8, 
both N,(t) and N,(¢) have limits £,n respectively, as ¢ approaches infinity. 
Lemma 2 of Section 4.8 implies that (£7) is an equilibrium point of (4). 
Now, (§,7) obviously cannot equal (0,0) or (0, K,). Consequently, (£7) = 


(K,,0). O 
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PROOF OF THEOREM 6. Lemmas | and 2 above state that every solution 
N,(t), N,(t) of (4) which starts in regions I or II at time t=/, must ap- 
proach the equilibrium solution N,=K,, N,=0 as ¢ approaches infinity. 
Similarly, Lemma 3 shows that every solution N,(t), N2(f) of (4) which 
starts in region III at time t=, and remains there for all future time must 
also approach the equilibrium solution N, = K,, N,=0. Next, observe that 
any solution N,(t), N,(¢) of (4) which starts on /, or /, must immediately 
afterwards enter region II. Finally, if a solution N,(t), N,(t) of (4) leaves re- 
gion III, then it must cross the line /, and immediately afterwards enter re- 
gion II. Lemma 2 then forces this solution to approach the equilibrium 
solution NV, = K,, N,=0. Cc) 


Theorem 6 deals with the case of identical species; i.e, a=B=1. By a 
similar analysis (see Exercises 4—6) we can predict the outcome of the 
struggle for existence for all values of a and B. 


Reference 
Gause, G. F., ‘The Struggle for Existence,’ Dover Publications, New York, 1964. 


EXERCISES 
1. Rewrite the system of equations (4) in the form 
K, dN, K.-N.-N K, dN, 
a,N, dt l pe a,N, dt 


Then, subtract these two equations and integrate to obtain directly that N3(¢) 
approaches zero for all solutions N,(1), N2(t) of (4) with N,(to) >0. 


= K,—N,—-N). 


2. The system of differential equations 


dN, 
= Ni [arte (1- 8, — BN?) 

* 
i (*) 
G7 N2[- a2 21-411 — 522 )] 


is a model of two species competing for the same limited resource. Suppose that 
Cc, >a, and c,>a,. Deduce from Theorem 6 that N,(t) ultimately approaches 
zero if @\Cc2z> a c,, and N,(t) ultimately approaches zero if a,c, <a,¢}. 


3. In 1926, Volterra presented the following model of two species competing for 
the same limited food supply: 


dN, 
a =[b,-A, (AN, + h2N2) Ni 


dN, 
a [b.—Az(hyN, + AN. )]N2. 


Suppose that b, /A, > b2/A2. (The coefficient b;/A; is called the susceptibility of 
species i to food shortages.) Prove that species 2 will ultimately become extinct if 
N (to) >0. 


456 


4.11 The principle of competitive exclusion in population biology 


Problems 4—6 are concerned with the system of equations 


dN, a,N, aN, 2™ 
LT ee TT ee 


4, (a) Assume that K,/a> K, and K,/B< K,. Show that N,(f) approaches zero as 
t approaches infinity for every solution N,(t), N2(t) of (*) with Nj(to) >9. 
(b) Assume that K,/a< K, and K,/£ > K,. Show that N,(¢) approaches zero as 
t approaches infinity for every solution N,(t), N2(t) of (*) with N,N>(to) > 0. 
Hint: Draw the lines /, : N; + aN,= K, and /,: N,+ BN, = K), and follow the 
proof of Theorem 6. 


5. Assume that K,/a> K, and K,/B > Kj. Prove that all solutions N,(¢), N2(¢) of 
(*), with both N,(%)) and N,(to) positive, ultimately approach the equilibrium 
solution 


Hint: 

(a) Draw the lines /,:N,+aN,= Ky and /,: N,+ BN, = K,. The two lines divide 
the first quadrant into four regions (see Figure 2) in which both N, and N, 
have fixed signs. 


Figure 2 


(b) Show that all solutions N,(t), N2(¢) of (*) which start in either region II or 
III must remain in these regions and ultimately approach the equilibrium 
solution N;= N?, N,= N92. 

(c) Show that all solutions N,(7), N2(t) of (*) which remain exclusively in region 
I or region IV for all time ¢ > ¢) must ultimately approach the equilibrium 
solution N,= NP, N,= N°. 
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K /a 


K./B K, 


Figure 3 


6. Assume that K,/a< K, and K,/B< K,. 

(a) Show that the equilibrium solution V,=0, N,=0 of (*) is unstable. 

(b) Show that the equilibrium solutions N, = K,, N,=0 and N,=0, N,= K, of 
(*) are asymptotically stable. 

(c) Show that the equilibrium solution N,=N?, N,=N% (see Exercise 5) of (*) 
is a saddle point. (This calculation is very cumbersome.) 

(d) It is not too difficult to see that the phase portrait of (*) must have the form 
described in Figure 3. 


4.12 The Threshold Theorem of epidemiology 


Consider the situation where a small group of people having an infectious 
disease is inserted into a large population which is capable of catching the 
disease. What happens as time evolves? Will the disease die out rapidly, or 
will an epidemic occur? How many people will ultimately catch the dis- 
ease? To answer these questions we will derive a system of differential 
equations which govern the spread of an infectious disease within a popu- 
lation, and analyze the behavior of its solutions. This approach will also 
lead us to the famous Threshold Theorem of epidemiology which states 
that an epidemic will occur only if the number of people who are suscept- 
ible to the disease exceeds a certain threshold value. 

We begin with the assumptions that the disease under consideration 
confers permanent immunity upon any individual who has completely re- 
covered from it, and that it has a negligibly short incubation period. This 
latter assumption implies that an individual who contracts the disease be- 
comes infective immediately afterwards. In this case we can divide the 
population into three classes of individuals: the infective class (/), the sus- 
ceptible class (S) and the removed class (R). The infective class consists of 
those individuals who are capable of transmitting the disease to others. 
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The susceptible class consists of those individuals who are not infective, 
but who are capable of catching the disease and becoming infective. The 
removed class consists of those individuals who have had the disease and 
are dead, or have recovered and are permanently immune, or are isolated 
until recovery and permanent immunity occur. 

The spread of the disease is presumed to be governed by the following 
rules. 

Rule 1: The population remains at a fixed level N in the time interval 
under consideration. This means, of course, that we neglect births, deaths 
from causes unrelated to the disease under consideration, immigration and 
emigration. 

Rule 2: The rate of change of the susceptible population is proportional 
to the product of the number of members of (S) and the number of mem- 
bers of (J). 

Rule 3: Individuals are removed from the infectious class (J) at a rate 
proportional to the size of (/). 

Let S(¢),/(t), and R(t) denote the number of individuals in classes (S$), 
(J), and (R), respectively, at time ¢. It follows immediately from Rules 1-3 
that S(¢),/(t), R (2) satisfies the system of differential equations 


a =rSI-yI (1) 


for some positive constants r and y. The proportionality constant r is 
called the infection rate, and the proportionality constant y is called the re- 
moval rate. 

The first two equations of (1) do not depend on R. Thus, we need only 
consider the system of equations 


aS _ Ql _ gy 

a rSI, 7 =rSI— yl (2) 
for the two unknown functions S(t) and /(t). Once S(t) and /(f) are 
known, we can solve for R(t) from the third equation of (1). Alternately, 


observe that d(S +/+ R)/dt=0. Thus, 
S(t)+1(t)+ R(t)=constant= N 
so that R(t)= N—S(t)—I (2). 
The orbits of (2) are the solution curves of the first-order equation 


dl _rSI-yI Y 
dS rst '* 75° (3) 
Integrating this differential equation gives 
I(S)=Ip+ Sy-S+pin=, (4) 
0 
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(SI) 


p 
Figure 1. The orbits of (2) 


where S, and J, are the number of susceptibles and infectives at the initial 
time t= fp, and p=y/r. To analyze the behavior of the curves (4), we com- 
pute /’(S)= —1+p/S. The quantity —1+ p/S is negative for S >p, and 
positive for S<p. Hence, J(S) is an increasing function of S for S<p, and 
a decreasing function of S for S >p. 

Next, observe that /(0)=— oo and J(S,))=J)>0. Consequently, there 
exists a unique point S,,, with O< S, < So, such that J(S,,)=0, and J(S) 
>0 for S,,< S< Sp. The point (S,,,0) is an equilibrium point of (2) since 
both dS/dt and dI/dt vanish when J=0. Thus, the orbits of (2), for ty <1 
< oo, have the form described in Figure 1. 

Let us see what all this implies about the spread of the disease within 
the population. As ¢ runs from fg to oo, the point (S(t),/(¢)) travels along 
the curve (4), and it moves along the curve in the direction of decreasing S, 
since S(t) decreases monotonically with time. Consequently, if S, is less 
than p, then J(t) decreases monotonically to zero, and S(t) decreases 
monotonically to S,,. Thus, if a small group of infectives J) is inserted into 
a group of susceptibles Sp, with Sy <p, then the disease will die out rapidly. 
On the other hand, if S, is greater than p, then J(t) increases as S(t) de- 
creases to p, and it achieves a maximum value when S=p. It only starts 
decreasing when the number of susceptibles falls below the threshold value 
o. From these results we may draw the following conclusions. 

Conclusion 1: An epidemic will occur only if the number of susceptibles 
in a population exceeds the threshold value p= y/r. 

Conclusion 2: The spread of the disease does not stop for lack of a sus- 
ceptible population; it stops only for lack of infectives. In particular, some 
individuals will escape the disease altogether. 

Conclusion 1 corresponds to the general observation that epidemics 
tend to build up more rapidly when the density of susceptibles is high due 
to overcrowding, and the removal rate is low because of ignorance, inade- 
quate isolation and inadequate medical care. On the other hand, outbreaks 
tend to be of only limited extent when good social conditions entail lower 
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densities of susceptibles, and when removal rates are high because of good 
public health vigilance and control. 

If the number of susceptibles S, is initially greater than, but close to, the 
threshold value p, then we can estimate the number of individuals who 
ultimately contract the disease. Specifically, if S)— is small compared to 
p, then the number of individuals who ultimately contract the disease is ap- 
proximately 2(S,—p). This is the famous Threshold Theorem of epidemiol- 
ogy, which was first proven in 1927 by the mathematical biologists 
Kermack and McKendrick. 


Theorem 7 (Threshold Theorem of epidemiology). Let So=p+v and 
assume that v/p is very small compared to one. Assume moreover, that the 
number of initial infectives Ig is very small. Then, the number of individu- 
als who ultimately contract the disease is 2v. In other words, the level of 
susceptibles is reduced to a point as far below the threshold as it originally 
was above it. 


ProoF. Letting ¢ approach infinity in (4) gives 


foe) 


So 


If J) is very small compared to So, then we can neglect it, and write 


S 
O= So~ Sin tpln-— 
So (So—S,) 
=S,-S,+p 
0 So 
So- Sy 
= Sy— S..+ pln} 1-{ —<— } |. 
0 


Now, if Sp) —p is small compared to p, then S)— S,, will be small compared 
to Sp. Consequently, we can truncate the Taylor series 


,_(S0=S\]__(So~ Se) _1( So Sx)", 
So 7 So ny 


after two terms. Then, 
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Solving for Sy— S,,, we see that 


So p 
So— S$. =2Sp "7 —1}=2(p+r) 


v “1 
=2p+»)— =2 o(1+ 2) 2s. 0 


During the course of an epidemic it is impossible to accurately ascertain 
the number of new infectives each day or week, since the only infectives 
who can be recognized and removed from circulation are those who seek 
medical aid. Public health statistics thus record only the number of new re- 
movals each day or week, not the number of new infectives. Therefore, in 
order to compare the results predicted by our model with data from actual 
epidemics, we must find the quantity dR/dt as a function of time. This is 
accomplished in the following manner. Observe first that 


dR_, oy 
7 == Y(N-R-S). 


Second, observe that 


_—_—__— 8 ooo So -CO 


dR dR/dt yl pp’ 


Hence, S(R)= S,e~*/? and 
dR 
dt 


Equation (5) is separable, but cannot be solved explicitly. However, if the 
epidemic is not very large, then R/p is small and we can truncate the 
Taylor series 


=y(N—R-Sye*/?). (5) 


eon R45() +... 


after three terms. With this approximation, 


qa N-R- S,[1-R/p+ 3(R/o) || 


So (8) 
=y| N—S,+{— —-1]R-—I-— 
Y 0 (- 9) 0 


The solution of this equation is 


R()= | = —1+atanb( 5on—9)| (6) 
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where 
1/2 


S 
¢=tanh7!4 | 
2 a\ p 


Ss 2 2S,(N-S 
(=-1) + o ( o) 
p 


and the hyperbolic tangent function tanhz is defined by 


2 


It is easily verified that 


4 tanhz =sech?z = —_4__ 
dz (e? +e”) 
Hence, 
202 
an = © sech?( 5 av! - 9), (7) 
0 


Equation (7) defines a symmetric bell shaped curve in the t-dR/dt plane 
(see Figure 2). This curve is called the epidemic curve of the disease. It 
illustrates very well the common observation that in many actual epidem- 
ics, the number of new cases reported each day climbs to a peak value and 
then dies away again. 


gR 
dt 


2¢/aY 
Figure 2 


Kermack and McKendrick compared the values predicted for dR/dt 
from (7) with data from an actual plague in Bombay which spanned the 
last half of 1905 and the first half of 1906. They set 


a = 890 sech?(0.2/—3.4) 

with ¢ measured in weeks, and compared these values with the number of 
deaths per week from the plague. This quantity is a very good approxima- 
tion of dR/dt, since almost all cases terminated fatally. As can be seen 
from Figure 3, there is excellent agreement between the actual values of 
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DEATHS 


5 10 15 20 25 30 
WEEKS 
Figure 3 


dR /dt, denoted by , and the values predicted by (7). This indicates, of 
course, that the system of differential equations (1) is an accurate and reli- 
able model of the spread of an infectious disease within a population of 
fixed size. 
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EXERCISES 
1. Derive Equation (6). 


2. Suppose that the members of (S) are vaccinated against the disease at a rate A 
proportional to their number. Then, 


aS _ _ +ST-)S, a =rSI— yl. (*) 


(a) Find the orbits of (*). 
(b) Conclude from (a) that S(¢) approaches zero as t approaches infinity, for ev- 
ery solution S(t), I(t) of (*). 
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. Suppose that the members of (S) are vaccinated against the disease at a rate A 
proportional to the product of their numbers and the square of the members of 
(7). Then, 
dS = — _ 2 dI _ _ * 
a rSI—-ASI*, i I(rS— y). (*) 
(a) Find the orbits of (*). 


(b) Will any susceptibles remain after the disease dies out? 


. The intensity i of an epidemic is the proportion of the total number of suscept- 
ibles that finally contracts the disease. Show that 


— Ipt So— Sa 
i= a 
where S,, is a root of the equation 

S= Sye'>~ So 40)/e, 


. Compute the intensity of the epidemic if p= 1000, J>=10, and (a) Sp=1100, (b) 
So = 1200, (c) Sgo= 1300, (d) Sp= 1500, (e) Sp= 1800, (f) So= 1900. (This cannot 
be done analytically.) 


. Let R,, denote the total number of individuals who contract the disease. 

(a) Show that R,,=Jp+ So— S,- 

(b) Let R, denote the members of (R) who are removed from the population 
prior to the peak of the epidemic. Compute R,/R,, for each of the values of 
So in S5a—Sf. Notice that most of the removals occur after the peak. This type 
of asymmetry is often found in actual notifications of infectious diseases. 


. It was observed in London during the early 1900’s, that large outbreaks of 
measles epidemics recurred about once every two years. The mathematical biol- 
ogist H. E. Soper tried to explain this phenomenon by assuming that the stock 
of susceptibles is constantly replenished by new recruits to the population. Thus, 
he assumed that 


aS = — FSI +p, a =rSI-yI (*) 


for some positive constants r, y, and p. 

(a) Show that S=y/r, /=p/y is the only equilibrium solution of (*). 

(b) Show that every solution S(t), 7(t) of (*) which starts sufficiently close to 
this equilibrium point must ultimately approach it as ¢ approaches infinity. 

(c) It can be shown that every solution S(t), 7(t) of (*) approaches the 
equilibrium solution S=y/r, 1=p/y as t approaches infinity. Conclude, 
therefore, that the system (*) does not predict recurrent outbreaks of measles 
epidemics. Rather, it predicts that the disease will ultimately approach a 
steady state. 


4.13 A model for the spread of gonorrhea 


Gonorrhea ranks first today among reportable communicable diseases in 
the United States. There are more reported cases of gonorrhea every year 
than the combined totals for syphilis, measles, mumps, and infectious 
hepatitis. Public health officials estimate that more than 2,500,000 Ameri- 
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cans contract gonorrhea every year. This painful and dangerous disease, 
which is caused by the gonococcus germ, is spread from person to person 
by sexual contact. A few days after the infection there is usually itching 
and burning of the genital area, particularly while urinating. About the 
same time a discharge develops which males will notice, but which females 
may not notice. Infected women may have no easily recognizable symp- 
toms, even while the disease does substantial internal damage. Gonorrhea 
can only be cured by antibiotics (usually penicillin). However, treatment 
must be given early if the disease is to be stopped from doing serious 
damage to the body. If untreated, gonorrhea can result in blindness, steril- 
ity, arthritis, heart failure, and ultimately, death. 

In this section we construct a mathematical model of the spread of 
gonorrhea. Our work is greatly simplified by the fact that the incubation 
period of gonorrhea is very short (3-7 days) compared to the often quite 
long period of active infectiousness. Thus, we will assume in our model 
that an individual becomes infective immediately after contracting gonor- 
rhea. In addition, gonorrhea does not confer even partial immunity to 
those individuals who have recovered from it. Immediately after recovery, 
an individual is again susceptible. Thus, we can split the sexually active 
and promiscuous portion of the population into two groups, susceptibles 
and infectives. Let c,(t) be the total number of promiscuous males, c,(t) 
the total number of promiscuous females, x(t) the total number of infec- 
tive males, and y(t) the total number of infective females, at time ft. Then, 
the total numbers of susceptible males and susceptible females are c,(t) — 
x(t) and c,(t)—y(t) respectively. The spread of gonorrhea is presumed to 
be governed by the following rules: 

1. Male infectives are cured at a rate a, proportional to their total num- 
ber, and female infectives are cured at a rate a, proportional to their total 
number. The constant a, is larger than a, since infective males quickly de- 
velop painful symptoms and therefore seek prompt medical attention. 
Female infectives, on the other hand, are usually asymptomatic, and there- 
fore are infectious for much longer periods. 

2. New infectives are added to the male population at a rate b, propor- 
tional to the total number of male susceptibles and female infectives. Simi- 
larly, new infectives are added to the female population at a rate b, pro- 
portional to the total number of female susceptibles and male infectives. 

3. The total numbers of promiscuous males and promiscuous females re- 
main at constant levels c, and c,, respectively. 

It follows immediately from rules 1-3 that 


d 
PT aUxt bi (ce, x) 


(1) 


dy 
mw. 2 + by(c,—y)x. 
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Remark. The system of equations (1) treats only those cases of gonorrhea 
which arise from heterosexual contacts; the case of homosexual contacts 
(assuming no interaction between heterosexuals and homosexuals) is 
treated in Exercises 5 and 6. The number of cases of gonorrhea which arise 
from homosexual encounters is a small percentage of the total number of 
incidents of gonorrhea. Interestingly enough, this situation is completely 
reversed in the case of syphilis. Indeed, more than 90% of all cases of 
syphilis reported in the state of Rhode Island during 1973 resulted from 
homosexual encounters. (This statistic is not as startling as it first appears. 
Within ten to ninety days after being infected with syphilis, an individual 
usually develops a chancre sore at the spot where the germs entered the 
body. A homosexual who contracts syphilis as a result of anal intercourse 
with an infective will develop a chancre sore on his rectum. This individ- 
ual, naturally, will be reluctant to seek medical attention, since he will then 
have to reveal his identity as a homosexual. Moreover, he feels no sense of 
urgency, since the chancre sore is usually painless and disappears after 
several days. With gonorrhea, on the other hand, the symptoms are so 
painful and unmistakable that a homosexual will seek prompt medical 
attention. Moreover, he need not reveal his identity as a homosexual since 
the symptoms of gonorrhea appear in the genital area.) 


Our first step in analyzing the system of differential equations (1) is to 
show that they are realistic. Specifically, we must show that x(t) and y(t) 
can never become negative, and can never exceed c, and c,, respectively. 
This is the content of Lemmas | and 2. 


Lemma 1. Jf x(t) and y(t) are positive, then x(t) and y(t) are positive for 
all t > to. 


Lemma 2. If x({o) is less than c, and y(to) is less than c,, then x(t) is less 
than c, and y(t) is less than c, for all t > to. 


PROOF OF LEMMA |. Suppose that Lemma | is false. Let t* > ft, be the first 
time at which either x or y is zero. Assume that x is zero first. Then, 
evaluating the first equation of (1) at r=2r* gives x(t*)=b,c, y(t*). This 
quantity is positive. (Note that y(t*) cannot equal zero since x =0, y =0 is 
an equilibrium solution of (1).) Hence, x(t) is less than zero for ¢ close to, 
and less than ¢*. But this contradicts our assumption that ¢* is the first 
time at which x(t) equals zero. We run into the same contradiction if y(t*) 
=0. Thus, both x(#) and y(¢) are positive for t > fo. 0 


PROOF OF LEMMA 2. Suppose that Lemma 2 is false. Let ¢* >t, be the first 
time at which either x=c,, or y=c. Suppose that x(¢*)=c,. Evaluating 
the first equation of (1) at = 7¢* gives x(t*)= — a,c,. This quantity is nega- 
tive. Hence, x(t) is greater than c, for ¢ close to, and less than ¢*. But this 
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contradicts our assumption that /* is the first time at which x(t) equals c;. 
We run into the same contradiction if y(t*)=c,. Thus, x(¢) is less than c, 
and y(f) 1s less than c, for ¢ > fo. OC 


Having shown that the system of equations (1) is a realistic model of 
gonorrhea, we now see what predictions it makes concerning the future 
course of this disease. Will gonorrhea continue to spread rapidly and un- 
controllably as the data in Figure 1 seems to suggest, or will it level off 
eventually? The following extremely important theorem of epidemiology 
provides the answer to this question. 


Theorem 8. 
(a) Suppose that a,a, is less than b,byc,c,. Then, every solution x(t), 
y(t) of (1) with O< x(to)< c, and 0< y(to) < c¢, approaches the 
equilibrium solution 
~ a,b, +b,b,c,’ 7 aod, +b,byc, 


as t approaches infinity. In other words, the total numbers of infective 
males and infective females will ultimately level off. 

(b) Suppose that a,a, is greater than b,b,c,c. Then every solution x(t), 
y(t) of (1) with O< x(t) <c,; and 0<y(to)<c,, approaches zero as t ap- 
proaches infinity. In other words, gonorrhea will ultimately die out. 


Our first step in proving part (a) of Theorem 8 is to split the rectangle 
O<x<c,, O0<y<c, into regions in which both dx/dt and dy/dt have 
fixed signs. This is accomplished in the following manner. Setting dx /dt= 
0 in (1), and solving for y as a function of x gives 


a,x 


y= ——————- =, (x). 
b(aq—xy 
Similarly, setting dy /dt=0 in (1) gives 
ayy b5C4X 


x=—————,, or y= ——— = ¢@, (x). 


Observe first that ¢,(x) and ¢,(x) are monotonic increasing functions of x; 
$,(x) approaches infinity as x approaches c,, and $,(x) approaches c, as x 
approaches infinity. Second, observe that the curves y =¢,(x) and y =¢,(x) 
intersect at (0,0) and at (Xo, yo) where 


_— b,byc,C, — aa, y= b,b5c,c,— a, a, 
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Figure 2 


Third, observe that $,(x) is increasing faster than $,(x) at x =0, since 
, bc» ay , 
2 (0) = a "Bey = ¢; (0). 


Hence, ¢$,(x) lies above $,(x) for 0< x <x, and ¢,(x) lies below ¢,(x) for 
Xg<x<c,, as shown in Figure 2. The point (Xp, yo) is an equilibrium point 
of (1) since both dx /dt and dy/dt are zero when x= x, and y= yo. 

Finally, observe that dx/dt is positive at any point (x,y) above the 
curve y =¢$,(x), and negative at any point (x,y) below this curve. Similarly, 
dy / dt is positive at any point (x,y) below the curve y =¢,(x), and negative 
at any point (x,y) above this curve. Thus, the curves y=¢,(x) and y= 
$,(x) split the rectangle O0< x<c,, O<y<c, into four regions in which 
dx / dt and dy /dt have fixed signs (see Figure 2). 

Next, we require the following four simple lemmas. 


Lemma 3. Any solution x(t), y(t) of (1) which starts in region | at time t= ty 
will remain in this region for all future time t 2 tg and approach the 
equilibrium solution x = Xo, Y=) as t approaches infinity. 


PRooF. Suppose that a solution x(f), y(t) of (1) leaves region I at time t= 
t*. Then, either x(¢*) or y(t*) is zero, since the only way a solution of (1) 
can leave region I is by crossing the curve y=¢,(x) or y=o,(x). Assume 
that x(¢*)=0. Differentiating both sides of the first equation of (1) with re- 
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spect to ¢ and setting t= ¢* gives 


ce oy (ey— x(t) 2. 


This quantity is positive, since x(z*) is less than c,, and dy /dt is positive on 
the curve y=9,(x), O0<x< x9. Hence, x(t) has a minimum at ¢=/*. But 
this is impossible, since x(t) is increasing whenever the solution x(t), y(t) is 
in region I. Similarly, if y(t*)=0, then 


d*y(t*) 
dt? 


=b,(c Co—y(t t*)) ae d 


This quantity is positive, since y(t*) is less than c,, and dx /dt is positive on 
the curve y=9¢,(x), 0<x< x. Hence, y(t) has a minimum at r=/¢*. But 
this is impossible, since y(¢) is increasing whenever the solution x(‘¢), y(Z) is 
in region I. 

The previous argument shows that any solution x(‘),y(t) of (1) which 
starts in region I at time ¢= f) will remain in region I for all future time ¢ > 
t). This implies that x(7) and y(¢) are monotonic increasing functions of 
time for ¢ > f, with x(7)< x, and y(t)<yo. Consequently, by Lemma | of 
Section 4.8, both x(¢) and y(t) have limits £,n, respectively, as t approaches 
infinity. Lemma 2 of Section 4.8 implies that (€,7) is an equilibrium point 
of (1). Now, it is easily seen from Figure 2 that the only equilibrium points 
of (1) are (0,0) and (xo, yo). But (€,7) cannot equal (0,0) since both x(t) 
and y(f) are increasing functions of time. Hence, (&,7)=(Xp, yo), and this 
proves Lemma 3. L] 


Lemma 4. Any solution x(t), y(t) of (1) which starts in region III at time t= 
ty will remain in this region for all future time and ultimately approach the 
equilibrium solution x = Xo, Y =Yo- 


ProoF. Exactly the same as Lemma 3 (see Exercise 1). O 


Lemma 5. Any solution x(t), y(t) of (1) which starts in region II at time t= 
ty, and remains in region II for all future time, must approach the 
equilibrium solution x = Xo, Y =o as t approaches infinity. 


ProoF. If a solution x(t), y(¢) of (1) remains in region II for t> tp, then 
x(t) is monotonic decreasing and y(t) is monotonic increasing for ¢ > fo. 
Moreover, x(t) is positive and y(Z) is less than c,, for t > to. Consequently, 
by Lemma | of Section 4.8, both x(t) and y(t) have limits &,n respectively, 
as ¢ approaches infinity. Lemma 2 of Section 4.8 implies that (€,7) is an 
equilibrium point of (1). Now, (&,7) cannot equal (0,0) since y(f) is increas- 
ing for ¢ > tp. Therefore, (£1) =(x, Yo), and this proves Lemma 5. O 


471 


4 Qualitative theory of differential equations 


Lemma 6. Any solution x(t), y(t) of (1) which starts in region IV at time t= 
ty and remains in region IV for all future time, must approach the 
equilibrium solution x = Xo, y=) as t approaches infinity. 


Proor. Exactly the same as Lemma 5 (see Exercise 2). OC 


We are now in a position to prove Theorem 8. 


PROOF OF THEOREM 8. (a) Lemmas 3 and 4 state that every solution x(a), 
y(t) of (1) which starts in region I or III at time t= 4) must approach the 
equilibrium solution x =x), y=yo as ¢t approaches infinity. Similarly, 
Lemmas 5 and 6 state that every solution x(t), y(t) of (1) which starts in 
region II or IV and which remains in these regions for all future time, must 
also approach the equilibrium solution x = x9, y=) . Now, observe that if 
a solution x(t), y(t) of (1) leaves region II or IV, then it must cross the 
curve y=,(x) or y=@,(x), and immediately afterwards enter region I or 
region III. Consequently, all solutions x(t), y(t) of (1) which start in re- 
gions II and IV or on the curves y=¢,(x) and y=¢,(x), must also ap- 
proach the equilibrium solution x(t) = xo, y(t) =o. 0 


(b) PRooF #1. If a,a, is greater than b,b,c,c,, then the curves y=¢,(x) 
and y =¢,(x) have the form described in Figure 3 below. In region I, 
dx / dt is positive and dy /dt is negative; in region II, both dx /dt and dy /dt 
are negative; and in region III, dx /dt is negative and dy /dt is positive. It 
is a simple matter to show (see Exercise 3) that every solution x(t), y(t) of 
(1) which starts in region II at time ¢=¢) must remain in this region for all 


y= (x) 


Figure 3 
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future time, and approach the equilibrium solution x =0, y=0 as ¢ ap- 
proaches infinity. It is also trivial to show that every solution x(t), y(2) of 
(1) which starts in region I or region III at time ¢=f, must cross the curve 
y=6,(x) or y=¢,(x), and immediately afterwards enter region II (see Ex- 
ercise 4). Consequently, every solution x(t), y(t) of (1), with 0< x(t%)<c, 
and 0< y(t ))<c), approaches the equilibrium solution x =0, y=0 as ¢ ap- 
proaches infinity. C] 


PROOF #2. We would now like to show how we can use the Poincaré- 
Bendixson theorem to give an elegant proof of part (b) of Theorem 8. Ob- 
serve that the system of differential equations (1) can be written in the 


form 
d(x —a, dic, \ x bxy 
— = —_ . 2 
£)-(n. MEV5)-(3°3] Q) 
Thus, by Theorem 2 of Section 4.3, the stability of the solution x =0, y =0 


of (2) is determined by the stability of the equilibrium solution x =0, y =0 
of the linearized system 


d(x x —a, bc, \y x 
OA )A(ne “\G) 
The characteristic polynomial of the matrix A is 
A? +(a,+a,)A+ a,a,— b,b,c,¢, 
whose roots are 
5 1/2 
~ (a, + ay) + | (a, + a) —4(a,a,— bb ¢,¢2) | 
2 
It is easily verified that both these roots are real and negative. Hence, the 
equilibrium solution x =0, y =0 of (2) is asymptotically stable. This implies 
that any solution x(t), y(t) of (1) which starts sufficiently close to the 
origin x =y =O will approach the origin as ¢ approaches infinity. Now, 
suppose that a solution x(f), y(¢) of (1), with O< x(t)<c, and 0<y(%)< 
C,, does not approach the origin as ¢ approaches infinity. By the previous 
remark, this solution must always remain a minimum distance from the 
origin. Consequently, its orbit for ¢ > fy lies in a bounded region in the x — 
y plane which contains no equilibrium points of (1). By the Poincaré- 
Bendixson Theorem, therefore, its orbit must spiral into the orbit of a peri- 
odic solution of (1). But the system of differential equations (1) has no 
periodic solution in the first quadrant x >0, y 20. This follows im- 
mediately from Exercise 11, Section 4.8, and the fact that 


9 0 
ax bx + Oi (e1—*) 9] + y | — any + by(c,—y)x | 
=—(a,+a,+5,y+bx) 


is strictly negative if both x and y are nonnegative. Consequently, every 
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solution x(t), y(t) of (1), with O< x(t)<c, and 0< y (Lo) <cy approaches 
the equilibrium solution x =0,y =0 as ¢ approaches infinity. 0 


Now, it is quite difficult to evaluate the coefficients a,, aj, b,, 0, c,, and 
C>. Indeed, it is impossible to obtain even a crude estimate of a,, which 
should be interpreted as the average amount of time that a female remains 
infective. (Similarly, a, should be interpreted as the average amount of 
time that a male remains infective.) This is because most females do not 
exhibit symptoms. Thus, a female can be infective for an amount of time 
varying from just one day to well over a year. Nevertheless, it is still possi- 
ble to ascertain from public health data that a,a, 1s less than b,b,c,c,, as 
we now show. Observe that the condition a,a,< b,b,c,c, 1s equivalent to 


bic b5c 
1<()(2 
ay ay 


The quantity b,c,/a, can be interpreted as the average number of males 
that one female infective contacts during her infectious period, if every 
male is susceptible. Similarly, the quantity b,c,/a, can be interpreted as 
the average number of females that one male infective contacts during his 
infectious period, if every female is susceptible. The quantities b,c,/a, and 
b,c,/a, are called the maximal female and male contact rates, respectively. 
Theorem 8 can now be interpreted in the following manner. 


(a) If the product of the maximal male and female contact rates is greater 
than one, then gonorrhea will approach a nonzero steady state. 

(b) If the product of the maximal male and female contact rates is less 
than one, then gonorrhea will die out eventually. 


In 1973, the average number of female contacts named by a male infec- 
tive during his period of infectiousness was 0.98, while the average number 
of male contacts named by a female infective during her period of infec- 
tiousness was 1.15. These numbers are very good approximations of the 
maximal male and female contact rates, respectively, and their product 
does not exceed the product of the maximal male and female contact rates. 
(The number of contacts of a male or female infective during their period 
of infectiousness is slightly less than the maximal male or female contact 
rates. However, the actual number of contacts is often greater than the 
number of contacts named by an infective.) The product of 1.15 with 0.98 
is 1.0682. Thus, gonorrhea will ultimately approach a nonzero steady state. 


Remark. Our model of gonorrhea is rather crude since it lumps all prom- 
iscuous males and all promiscuous females together, regardless of age. A 
more accurate model can be obtained by separating the male and female 
populations into different age groups and then computing the rate of 
change of infectives in each age group. This has been done recently, but 
the analysis is too difficult to present here. We just mention that a result 
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completely analogous to Theorem 8 is obtained: either gonorrhea dies out 
in each age group, or it approaches a constant, positive level in each age 


group. 
EXERCISES 


In Problems 1 and 2, we assume that a,a,< b,b5c Cp. 


1. (a) Suppose that a solution x(t), y(2) of (1) leaves region III of Figure 2 at time 
t=t* by crossing the curve y =¢,(x) or y =,(x). Conclude that either x(t) 
or y(t) has a maximum at t=f*. Then, show that this is impossible. Con- 
clude, therefore, that any solution x(t), y(t) of (1) which starts in region III 
at time ¢= 7%) must remain in region III for all future time ¢> fo. 

(b) Conclude from (a) that any solution x(7¢), y(t) of (1) which starts in region 
III has a limit £,7 as ¢ approaches infinity. Then, show that (€,7) must equal 


(Xo; Yo): 


2. Suppose that a solution x(#), y(t) of (1) remains in region IV of Figure 2 for all 
time ¢ > fp. Prove that x(7) and y(#) have limits £,7 respectively, as t approaches 
infinity. Then conclude that (£7) must equal (x9, yo). 


In Problems 3 and 4, we assume that a,a, > b,bc,c>. 


3. Suppose that a solution x(Z), y(t) of (1) leaves region II of Figure 3 at time t= 7* 
by crossing the curve y=¢,(x) or y=¢,(x). Show that either x(t) or y(t) has a 
maximum at ¢=¢*. Then, show that this is impossible. Conclude, therefore, that 
every solution x(¢), y(¢) of (1) which starts in region II at time ¢= ¢) must remain 
in region II for all future time ¢ > fo. 


4. (a) Suppose that a solution x(¢), y(t) of (1) remains in either region I or III of 
Figure 3 for all time ¢> %. Show that x(t) and y(Z) have limits &,y respec- 
tively, as ¢ approaches infinity. 

(b) Conclude from Lemma 1 of Section 4.8 that (€ 4) =(0,0). 

(c) Show that (£,7n) cannot equal (0,0) if x(#), y(t) remains in region I or region 
III for all time ¢ > fo. 

(d) Show that any solution x(¢),y(t) of (1) which starts on either y=¢,(x) or y= 
$2(x) will immediately afterwards enter region II. 


5. Assume that a,a) < b,byc,c>. Prove directly, using Theorem 2 of Section 4.3, that 
the equilibrium solution x= x9, y=yo of (1) is asymptotically stable. Warning: 
The calculations are extremely tedious. 


6. Assume that the number of homosexuals remains constant in time. Call this con- 
stant c. Let x(t) denote the number of homosexuals who have gonorrhea at time 
t. Assume that homosexuals are cured of gonorrhea at a rate a;, and that new 
infectives are added at a rate B,(c— x)x. 

(a) Show that x= —a,x+ B,x(c— x). 
(b) What happens to x(t) as ¢ approaches infinity? 

7. Suppose that the number of homosexuals c(t) grows according to the logistic 
law c=c(a— bc), for some positive constants a and b. Let x(#) denote the num- 
ber of homosexuals who have gonorrhea at time ¢, and assume (see Problem 6) 
that x= — a,x+ B,x(c— x). What happens to x(f) as t approaches infinity? 
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5.1 Two point boundary-value problems 


In the applications which we will study in this chapter, we will be con- 
fronted with the following problem. 


Problem: For which values of A can we find nontrivial functions y(x) 
which satisfy 


—+dAy=0; = ay(0)+ by’(0)=0,_ cy(/) + dy’(/) = 0? (1) 


Equation (1) is called a boundary-value problem, since we prescribe infor- 
mation about the solution y(x) and its derivative y’(x) at two distinct 
points, x =O and x=/. In an initial-value problem, on the other hand, we 
prescribe the value of y and its derivative at a single point x = Xo. 

Our intuitive feeling, at this point, is that the boundary-value problem 
(1) has nontrivial solutions y(x) only for certain exceptional values A. To 
wit, y(x)=0 is certainly one solution of (1), and the existence—uniqueness 
theorem for second-order linear equations would seem to imply that a 
solution y(x) of y” +Ay =0 is determined uniquely once we prescribe two 
additional pieces of information. Let us test our intuition on the following 
simple, but extremely important example. 


Example 1. For which values of \ does the boundary-value problem 


ay 


dx? 
have nontrivial solutions? 


+Ay=0;  -y(0)=0, y(/)=0 (2) 


476 


5.1 Two point boundary-value problems 


Solution. 

(i) A=0. Every solution y(x) of the differential equation y” =0 is of the 
form y(x)=c,x+ >, for some choice of constants c, and c,. The condition 
y(0)=0 implies that c, =0, and the condition y(/) =0 then implies that c, = 
0. Thus, y(x)=0 is the only solution of the boundary-value problem (2), 
for \=0. 

(ii) A<0: In this case, every solution y(x) of y” +Ay =0 is of the form 
y(x)=ce’ > *+c,e7 VA *, for some choice of constants c, and c,. The 
boundary conditions y(0)=y(/)=0 imply that 


Cc) +c,=0, ev Alo te7 YA", =0. (3) 


The system of equations (3) has a nonzero solution c,,c, if, and only if, 


I I _VrAI VEX 
{er va) —e =0. 


This implies that ev ~*!=e-Y~*" or e?¥-4/=1. But this is impossible, 


since e” is greater than one “for z>0. Hence, c,=c,=0 and the boundary- 
value problem (2) has no nontrivial solutions y(x) when AJ is negative. 

(iii) A>0: In this case, every solution y(x) of y” +Ay =0 is of the form 
y(x)=c,cosVA x+c,sinVA x, for some choice of constants c, and c,. 
The condition y(0)=0 implies that c,=0, and the condition y(/)=0 then 
implies that c,sinVA /=0. This equation is satisfied, for any choice of c,, 
if VA /=nz, or \=n?q’/1?, for some positive integer n. Hence, the 
boundary-value problem (2) has nontrivial solutions y(x)=csinnmx /1 for 
A=n?a?/1?, n=1,2,.... 


Remark. Our calculations for the case 4<0 can be simplified if we write 
every solution y(x) in the form y=c,coshV —A x+c,sinhV —A x, where 


V-Ax —-V-Ax 
coshV —A x= oe 


and 
sinhV —A x= —{—___& 


The condition y(0)=0 implies that c,=0, and the condition y(/)=0 then 
implies that c,sinhV —A /=0. But sinhz is positive for z >0. Hence, c, =0, 
and y(x)=0. 


Example | is indicative of the general boundary-value problem (1). In- 


deed, we have the following remarkable theorem which we state, but do 
not prove. 
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Theorem 1. The boundary-value problem (1) has nontrivial solutions y(x) 
only for a denumerable set of values \,,\,..., where \; <X,..., and d, ap- 
proaches infinity as n approaches infinity. These special values of X are 
called eigenvalues of (1), and the nontrivial solutions y(x) are called eigen- 
functions of (1). In this terminology, the eigenvalues of (2) are 
a? / 17,40? / 17,927 /1*,..., and the eigenfunctions of (2) are all constant 
multiples of sinax /l,sin2ax/l,.... 


There is a very natural explanation of why we use the terms eigenvalue 
and eigenfunction in this context. Let V be the set of all functions y(x) 
which have two continuous derivatives and which satisfy ay(0)+ by’(0)= 
0,cy(/) + dy’(1)=0. Clearly, V is a vector space, of infinite dimension. 
Consider now the linear operator, or transformation L, defined by the 
equation 

d*y 
Ly \(x)= —- —> (x). 4 
[ Ly (x) ree ) (4) 
The solutions y(x) of (1) are those functions y in V for which Ly =Ay. That 
is to say, the solutions y(x) of (1) are exactly those functions y in V which 
are transformed by L into multiples A of themselves. 


Example 2. Find the eigenvalues and eigenfunctions of the boundary-value 
problem 


d2 
a tw=0;  yO)+y'O)=0, y(1)=0. (5) 
Solution. 

(1) A=0. Every solution y(x) of y” =0 is of the form y(x)=c,x+ 0c), for 
some choice of constants c, and c,. The conditions y (0) + y’(0) =0 and y(1) 
=(Q both imply that c,= —c,. Hence, y(x)=c(x—1), c¥0, is a nontrivial 
solution of (5) when A=0; i.e., y(x)=c(x— 1), c40, is an eigenfunction of 
(5) with eigenvalue zero. 

(11) A<O. In this case, every solution y(x) of y” +Ay=0 is of the form 
y(x)=c,coshV —A x+c,sinhV —A x, for some choice of constants c, and 
c,. The boundary conditions y(0)+y’(0)=0 and y(1)=0 imply that 


c+ V-A c,=0, coshV —A c,+sinhV —A c,=0. (6) 


(Observe that (cosh x)’=sinhx and (sinh x)’=coshx.) The system of equa- 
tions (6) has a nontrivial solution c,,c, if, and only if, 


ct l vA |=sinnV=K ~V-—) coshV —A =0. 
coshV —-A sinh V —A 


This implies that 

sinhV -A =V—AcoshV—-A. (7) 
But Equation (7) has no solution \ <0. To see this, let z= V —A , and con- 
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sider the function h(z)=zcoshz— sinhz. This function is zero for z=0 and 
is positive for z >0, since its derivative 


h'(z)=coshz + z sinhz —coshz =z sinhz 


is strictly positive for z>0. Hence, no negative number A can satisfy (7). 

(ii) A>0. In this case, every solution y(x) of y” +Ay =0 is of the form 
y(x)=c¢,cosVA x +c,sinVA x, for some choice of constants c, and c¢. 
The boundary conditions imply that 


¢.+VAc,=0, cosVAc,tsinVA c,=0. (8) 


The system of equations (8) has a nontrivial solution c,,c, if, and only if, 


cet | VA |=sinvx —VX cosVA =0. 
cosVAsinVA 


This implies that 
tanVA =V). (9) 


To find those values of \ which satisfy (9), we set = VA and draw the 
graphs of the functions y= and ny =tan€ in the —7 plane (see Figure 1); 
the € coordinate of each point of intersection of these curves is then a root 
of the equation €=tané. It is clear that these curves intersect exactly once 
in the interval 7/2 <£<37/2, and this occurs at a point €, >. Similarly, 
these two curves intersect exactly once in the interval 37/2<&€<52/2, and 
this occurs at a point , >27. More generally, the curves n={ and n= tang 
intersect exactly once in the interval 


2n—1)7 2n+1)7 
Gna tt, nti 
2 2 
and this occurs at a point & >a. 


7) 
| | N= tan & 


Figure 1. Graphs of 7 =é and n= tang 
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Finally, the curves n= and n= tan§€ do not intersect in the interval 0< 
€<2/2. To prove this, set h(€)=tan€—& and compute 
h’() =sec*é— 1=tané. 


This quantity is positive for O0<&< 7/2. Consequently, the eigenvalues of 
(5) are A, =&?,A,=é3,..., and the eigenfunction of (5) are all constant mul- 


tiples of the functions — VA, cosVA, x+sinVA, x,-— VA, cosVA, x 
+ sin VAG x,.... We cannot compute A, exactly. Nevertheless, we know 
that 


nea? <r, <(2n+1)n?/4. 


In addition, it is clear that A, approaches (2n + 1)’2?/4 as n approaches in- 
finity. 


EXERCISES 


Find the eigenvalues and eigenfunctions of each of the following 
boundary-value problems. 


lL. y”+Ay=0; = y(O)=0, y(/)=0 

-y"+hy=0; y'(0)=0, y'(/)=0 

y"—dy=0;  y'(O)=0, y'()=0 

-y"+hy=0; y(0)=0, y()=0 

-y"+hy=0;  y(0)=0, y(7)—y(7)=0 

-y"thy=0; y(0)—y'O=0, y()=0 

yy" +hy=0;  — y(0)—y’(0)=0,  y(7)—y'(7) =0 

. For which values of A does the boundary-value problem 
y”—2y’+(1+A)y=0; = y(0)=0, y(1)=0 


have a nontrivial solution? 


eo NN KH Ss WS N 


9. For which values of \ does the boundary-value problem 


y"+dy=0; yy (0)=y(27), y'(0)=y'(27) 
have a nontrivial solution? 


10. Consider the boundary-value problem 


y"t+hy =f); y(0)=0, y()=0 (*) 
(a) Show that (*) has a unique solution y(#) if A is not an eigenvalue of the ho- 
mogeneous problem. 
(b) Show that (*) may have no solution y(f) if A is an eigenvalue of the homo- 
geneous problem. 
(c) Let A be an eigenvalue of the homogeneous problem. Determine conditions 
on f so that (*) has a solution y(¢). Is this solution unique? 
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5.2 Introduction to partial differential equations 


Up to this point, the differential equations that we have studied have all 
been relations involving one or more functions of a single variable, and 
their derivatives. In this sense, these differential equations are ordinary dif- 
ferential equations. On the other hand, many important problems in ap- 
plied mathematics give rise to partial differential equations. A partial dif- 
ferential equation is a relation involving one or more functions of several 
variables, and their partial derivatives. For example, the equation 


is a partial differential equation for the function u(x,t), and the equations 


du _ dv du_ dv 


ax dy’ dy ax 

are a system of partial differential equations for the two functions u(x,y) 
and v(x,y). The order of a partial differential equation is the order of the 
highest partial derivative that appears in the equation. For example, the 
order of the partial differential equation 


au _5 0% 
or2 dxdt 


+u 


is two, since the order of the highest partial derivative that appears in this 
equation is two. 

There are three classical partial differential equations of order two 
which appear quite often in applications, and which dominate the theory 
of partial differential equations. These equations are 


ou _ 207u 1 
at aa” (1) 
07u 5 07u 
Otu _ 2d°u 2) 
at? Ox? ( 
and 
2 2 
d*u, dUu_p (3) 
dx? dy” 


Equation (1) is known as the heat equation, and it appears in the study 
of heat conduction and other diffusion processes. For example, consider a 
thin metal bar of length / whose surface is insulated. Let u(x,t) denote the 
temperature in the bar at the point x at time ¢. This function satisfies the 
partial differential equation (1) for 0<.x</. The constant a* is known as 
the thermal diffusivity of the bar, and it depends solely on the material 
from which the bar is made. 
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Equation (2) is known as the wave equation, and it appears in the study 
of acoustic waves, water waves and electromagnetic waves. Some form of 
this equation, or a generalization of it, almost invariably arises in any 
mathematical analysis of phenomena involving the propagation of waves 
in a continuous medium. (We will gain some insight into why this is so in 
Section 5.7.) The wave equation also appears in the study of mechanical 
vibrations. Suppose, for example, that an elastic string of length /, such as 
a violin string or guy wire, is set in motion so that it vibrates in a vertical 
plane. Let u(x,7) denote the vertical displacement of the string at the point 
x at time ¢ (see Figure 1). If all damping effects, such as air resistance, are 
negligible, and if the amplitude of the motion is not too large, then u(x, 7) 
will satisfy the partial differential equation (2) on the interval 0< x </. In 
this case, the constant c? is H/p, where H is the horizontal component of 
the tension in the string, and p is the mass per unit length of the string. 


u(x.t) 


Figure 1 


Equation (3) is known as Laplace’s equation, and is the most famous of 
all partial differential equations. It arises in the study of such diverse ap- 
plications as steady state heat flow, vibrating membranes, and electric and 
gravitational potentials. For this reason, Laplace’s equation is often re- 
ferred to as the potential equation. 

In addition to the differential equation (1), (2), or (3), we will often im- 
pose initial and boundary conditions on the function u. These conditions 
will be dictated to us by the physical and biological problems themselves; 
they will be chosen so as to guarantee that our equation has a unique solu- 
tion. 

As a model case for the heat equation (1), we consider a thin metal bar 
of length / whose sides are insulated, and we let u(x,t) denote the tempera- 
ture in the bar at the point x at time ¢. In order to dermine the temperature 
in the bar at any time ¢ we need to know (i) the initial temperature distrib- 
ution in the bar, and (ii) what is happening at the ends of the bar. Are they 
held at constant temperatures, say 0°C, or are they insulated, so that no 
heat can pass through them? (This latter condition implies that u,(0,/)= 
u,(/,t)=0.) Thus, a “well posed” problem for diffusion processes is the 
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heat equation (1), together with the initial condition u(x,0)= f(x), O0<x< 
/, and the boundary conditions u(0,?)=u(/,7)=0, or u,(0,t)=u,(1,0) =0. 

As a model case for the wave equation, we consider an elastic string of 
length /, whose ends are fixed, and which is set in motion in a vertical 
plane. In order to determine the position u(x,7) of the string at any time ¢ 
we need to know (1) the initial position of the string, and (ii) the initial 
velocity of the string. It is also implicit that u(0,7)=u(/,t)=0. Thus, a well 
posed problem for wave propagation is the differential equation (2) 
together with the initial conditions u(x,0)= f(x), u,(x,0)= g(x), and the 
boundary conditions u(0, t) = u(/, 2) =0. 

The partial differential equation (3) does not contain the time ¢, so that 
we do not expect any “initial conditions” to be imposed here. In the prob- 
lems that arise in applications, we are given u, or its normal derivative, on 
the boundary of a given region R, and we seek to determine u(x,y) inside 
R. The problem of finding a solution of Laplace’s equation which takes on 
given boundary values is known as a Dirichlet problem, while the problem 
of finding a solution of Laplace’s equation whose normal derivative takes 
on given boundary values is known as a Neumann problem. 

In Section 5.3 we will develop a very powerful method, known as the 
method of separation of variables, for solving the boundary-value problem 


(strictly speaking, we should say “initial boundary-value problem”) 
ou = 2 dou ; u(x,0)=f(x), O<x<l; u(0,t)=u(l,t)=0. 
é dx? 


After developing the theory of Fourier series in Sections 5.4 and 5.5, we 
will show that the method of separation of variables can also be used to 
solve more general problems of heat conduction, and several important 
problems of wave propagation and potential theory. 


5.3 The heat equation; separation of variables 


Consider the boundary-value problem 


Ou _ 2 0°u 0) = 0 I; u(0,t)=u(i,t)=0. (1 
Ware gai M4 D=S(x), O<x<h uON=u(L)=0. (1) 


Our goal is to find the solution u(x, ?) of (1). To this end, it is helpful to 
recall how we solved the initial-value problem 


2 
y dy 
We tp(t)7-+a()y=0;  yO)=ro y'(0)=yo- (2) 
First we showed that the differential equation 
d’y 
+ p(OF +4()y=0 (3) 
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is linear; that is, any linear combination of solutions of (3) is again a solu- 
tion of (3). And then, we found the solution y(¢) of (2) by taking an ap- 
propriate linear combination c, y,(t)+c,y,(t) of two linearly independent 
solutions y,(¢) and y,(t) of (3). Now, it is easily verified that any linear 
combination c,u,(x,A)+...+c,u,(x,?) of solutions u,(x,t),...,u,(x, ft) of 


du _ 2d°u 
of dx? (4) 
is again a solution of (4). In addition, if u,(x,7),...,u,(x,2) satisfy the 


boundary conditions u(0,t)=u(/,t)=0, then the linear combination c,u, 
+...+0c,u, also satisfies these boundary conditions. This suggests the 
following “game plan” for solving the boundary-value problem (1): 

(a) Find as many solutions u,(x,1?),u,(x,t),... aS we can of the 
boundary-value problem 


Ou _ 29°u. _ _ 
ay = a2» u(0,t)=u(/,t) =0. (5) 

(b) Find the solution u(x, ¢) of (1) by taking an appropriate linear com- 
bination of the functions u,(x,f), n=1,2,.... 

(a) Since we don’t know, as yet, how to solve any partial differential 
equations, we must reduce the problem of solving (5) to that of solving one 
or more ordinary differential equations. This is accomplished by setting 
u(x,t)=X (x)T(t) (hence the name separation of variables). Computing 
d7u 


—=xXT’ and Tz HX"T 
Ox 


we see that u(x,t)= X(x)T(f) is a solution of the equation u, = a*u,, 


(u,=du/dt and u,,=0*u/ dx’) if 


XT'=a°X"T. (6) 
Dividing both sides of (6) by a*XT gives 
XxX” T’ 

xX" _ iT 7 

XY 2 (7) 


Now, observe that the left-hand side of (7) is a function of x alone, while 
the right-hand side of (7) is a function of t alone. This implies that 

xX" Ta 

Y= A, and oF v (8) 
for some constant A. (The only way that a function of x can equal a func- 
tion of ¢ is if both are constant. To convince yourself of this, let f(x)=g2(2) 
and fix fy. Then, f(x)= g(¢) for all x, so that f(x)=constant=c,, and this 
immediately implies that g(t) also equals c,.) In addition, the boundary 
conditions 


0= u(0,t)=X (0)T (0), 
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and 
O=u(l,t)=X (1)T(t) 


imply that X (0)=0 and X(/)=0 (otherwise, u must be identically zero). 
Thus, u(x, t)= X (x)T (ft) is a solution of (5) if 


X”"+rAX =0; X(0)=0, X(/)=0 (9) 
and 
T’+da?T =0. (10) 


At this point, the constant A is arbitrary. However, we know from Example 
1 of Section 5.1 that the boundary-value problem (9) has a nontrivial solu- 
tion X (x) only if \=A, =n?77//?, n=1,2,...; and in this case, 

n1rXx 


X (x) =X, (x) =sin—. 


Equation (10), in turn, implies that 


T(t)=T,(t)=e7e er”, 
(Actually, we should multiply both X,(x) and T,(t) by constants; however, 
we omit these constants here since we will soon be taking linear combina- 
tions of the functions X,(x)T,,(¢).) Hence, 

u,, (x,t) =sin = eet /E 
is a nontrivial solution of (5) for every positive integer n. 

(b) Suppose that f(x) is a finite linear combination of the functions 

sinnax /1; that is, 


f(x)= > C, sin 


n=1 


Then, 
u(x,t)= > c, sin —— — en ennt/l 


is the desired solution of (1), since it is a linear combination of solutions of 
(5), and it satisfies the initial condition 


u(x, n= So, sin = f(x), O0<x<il. 


n=1 


Unfortunately, though, most functions f(x) cannot be expanded as a finite 
linear combination of the functions sinn7x//, n=1,2,..., on the interval 
0<x</. This leads us to ask the following question. 


Question: Can an arbitrary function f(x) be written as an infinite linear 
combination of the functions sinn7x //, n=1,2,..., on the interval 0< x< 
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/? In other words, given an arbitrary function f, can we find constants 
C1,C>,..., Such that 


(o @) 
f(x) =e, sin +e,sin 22 +... = ¢, sin ; 0<x<l? 


Remarkably, the answer to this question is yes, as we show in Section 5.5. 


Example 1. At time t=0, the temperature u(x,0) in a thin copper rod (a? 
= 1.14) of length one is 2sin37x +5sin87x, 0< x <1. The ends of the rod 
are packed in ice, so as to maintain them at 0°C. Find the temperature 
u(x,t) in the rod at any time ¢>0. 

Solution. The temperature u(x, ¢) satisfies the boundary-value problem 


Ou =| 140-u u(x,0)=2sin 32x +5sin87x, 0<x<l 
Ot x? u(0,t)=u(1,t)=0 
and this implies that 
u(x,t) =2sin 32x e794)" 4-5 sin Barx eA, 
EXERCISES 
Find a solution u(x,t) of the following problems. 
, du 17) 9°%. u(x,0)=sinax/2+3sinSax/2, 0<x<2 
" Of Ax?’ | (0,4) =u(2,t) =0 
, du =| 14.04. u(x,0)=sinax/2—3sin2ax, O0<x<2 
" Ot x2” u(0,t)=u(2,t)=0 


3. Use the method of separation of variables to solve the boundary-value problem 


Ou _ O*u , u(x,0)=3sin27x—7sin4dax, O0<x<10 
df ax?” | u(0, 1) =u (10,4) =0 


Use the method of separation of variables to solve each of the following 
boundary-value problems. 


Ou _ Ou, = py —2y 

4. at ay? u(0,y) =e” +e 
Ou _ ou, —p-—3t 2t 

6 Ou _ ou. u(0,y) =2e~” — e” 

“of oy” 

7. Ou _ ou _ u(t 0)=e 7% +4+2e7"— 14e!3! 
ot oy °° , 
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8. Determine whether the method of separation of variables can be used to re- 
place each of the following partial differential equations by pairs of ordinary 
differential equations. If so, find the equations. 

(a) tu,,+u, =0 (b) tu, + xu, =0 
(c) Uy, +(x —y)u,, =0 (d) Uy, +2u,, + u, =O 


9. The heat equation in two space dimensions 1s 
— a2 
u, = a (u,,+u,,). (*) 


(a) Assuming that u(x,y,t)=X (x) Y(y)T(d), find ordinary differential equa- 
tions satisfied by X, Y, and T. 

(b) Find solutions u(x,y,t) of (*) which satisfy the boundary conditions 
u(0,y,t)=0, u(a,y,t)=0, u(x,0,)=0, and u(x, b,1)=0. 


10. The heat equation in two space dimensions may be expressed in terms of polar 
coordinates as 


= 2 Tia t 
U, = ar) u,, + put 2 Ugo 
Assuming that u(r,9,t)= R(r)O(@)T (2), find ordinary differential equations 
satisfied by R, ©, and 7. 


5.4 Fourier series 


On December 21, 1807, an engineer named Joseph Fourier announced to 
the prestigious French Academy of Sciences that an arbitrary function f(x) 
could be expanded in an infinite series of sines and cosines. Specifically, let 
f(x) be defined on the interval —/< x </, and compute the numbers 


=f f(x)cos™™* dx, n=0,1,2,... (1) 
and 
/ 
b= 7 f f(x)sin “=~ dx, n=1,2,... (2) 
—! 
Then, the infinite series 
“0 + a, cos ~ +b, sin +. ere) a, cos =* +5, sin (3) 
207 ! l 2 2| if l | 


converges to f(x). Fourier’s announcement caused a loud furor in the 
Academy. Many of its prominent members, including the famous mathe- 
matician Lagrange, thought this result to be pure nonsense, since at that 
time it could not be placed on a rigorous foundation. However, mathema- 
ticians have now developed the theory of “Fourier series” to such an extent 
that whole volumes have been written on it. (Just recently, in fact, they 
have succeeded in establishing exceedingly sharp conditions for the Four- 
ler series (3) to converge. This result ranks as one of the great mathemati- 
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cal theorems of the twentieth century.) The following theorem, while not 
the most general theorem possible, covers most of the situations that arise 
in applications. 


Theorem 2. Let f and f’ be piecewise continuous on the interval —l<x <l. 
(This means that f and f’ have only a finite number of discontinuities on 
this interval, and both f and f’ have right- and left-hand limits at each 
point of discontinuity.) Compute the numbers a, and b,, from (1) and (2) 
and form the infinite series (3). This series, which is called the Fourier 
series for f on the interval —1< x <1, converges to f(x) if f is continuous 
at x, and to 3[ f(x+0)+f(x—0)]* if f is discontinuous at x. Atx= +1, 
the Fourier series (3) converges to 3{f(J)+f(—J], where f(+1) is the 
limit of f(x) as x approaches +1. 


Remark. The quantity 3[f(x+0)+ f(x —0)] is the average of the right- 
and left-hand limits of f at the point x. If we define f(x) to be the average 
of the right- and left-hand limits of f at any point of discontinuity x, then 
the Fourier series (3) converges to f(x) for all points x in the interval —/< 
x<il. 


Example 1. Let f be the, function which is 0 for —1< x<OQand | forO0<x 
< 1. Compute the Fourier series for f on the interval -l<x<l. 
Solution. In this problem, /=1. Hence, from (1) and (2), 


ay= fi f(x)de= fi dv=l, 


l 
a,=f f(x)cosnmxdx = [ cos nx dx =0, n> 
-] 0 
and 


. 1 
b= J flx)sinamxdx= [ sin nx dx 


1-(-1)" 
= | (1 ~cosnmy= n>. 
nw nT 


Notice that b, =0 for n even, and b,=2/nm for n odd. Hence, the Fourier 
series for f on the interval —1<x< 1 is 


1 . 2sin7x , 2sin3ax , 2sin5ax 


~ + ey gt .... 

2 T 397 Sa 
By Theorem 2, this series converges to 0 if —1<x<0, and to 1 if O<x<l. 
At x=—1, 0, and +1, this series reduces to the single number 5, which is 


the value predicted for it by Theorem 2. 


*The quantity f(x +0) denotes the limit from the right of f at the point x. Similarly, f(x —0) 
denotes the limit of f from the left. 
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Example 2. Let f be the function which is 1 for —2< x <0 and x for0< x 
<2. Compute the Fourier series for f on the interval —2 < x <2. 
Solution. In this problem /=2. Hence from (1) and (2), 


1? 10° nal Prag 
wal fom a fara hem? 


2 2 
a= { f(x)eos™ dx= 4 fr cos Fe dx + 9 [ xcos™™ dx 
2J/_, 2 2 Jo 2 


2 2 
= s(cosnmm—1), nol 
and 
u MUX dyed (a M™ gat nax 
b= 5 “S(e)sin 5 a= f sin) dx+ 5 xsin—, dx 
=—— > 1. 
14 +cosn7), n>1l 
Notice that a, =0 if n is even; a,=—4/n’2? if n is odd; b, =0 if n is odd; 
and b, = —2/nm if n is even. Hence, the Fourier series for f on the interval 
—2<x<2 is 
4 mx 1. 4 30x 1. 
]— 72 8 7 ~ 7 Sinax 9 7 COS 7, Sin2ax +... 


=|— 


“. cos(2n + 1)ax /2 ~ 
4 ( yn / = siInn1x . (4) 
T n=0 (2n+1) a n= 


By Theorem 2, this series converges to 1 if —2<x<0; to x, if0<x<2; to 
+ if x=0; and to $ if x+2. Now, at x=0, the Fourier series (4) is 


4/1 l l l 
1-4) 3 toatatat. | 
Thus, we deduce the remarkable identity 


1_, 4/i,1 7,1 7,1 
2! Slatete pe | 
Or 
1,1,1,1 _@ 
pip gp pr 8 


The Fourier coefficients a, and b, defined by (1) and (2) can be derived 
in a simple manner. Indeed, if a piecewise continuous function f can be ex- 
panded in a series of sines and cosines on the interval —/< x < /, then, of 
necessity, this series must be the Fourier series (3). We prove this in the 


489 


5 Separation of variables and Fourier series 


following manner. Suppose that f is piecewise continuous, and that 


f(xaet > | eeeos + q,sin | (5) 


for some numbers c, and d,. ‘Equation (5) is assumed to hold at all but a 
finite number of points in the interval —/< x </. Integrating both sides of 


l 
(5) between —/ and / gives col= [ f(x)dx, since 


l 


Similarly, multiplying both sides of (5) by cosm7x// and integrating be- 
tween —/ and / gives 


l l 
J cosB¥ a= f sin” ax = =0: k=1,2....* 
| —| 


ley= fi f(x) cos dx 


while multiplying both sides of (5) by sinnzx// and integrating between 
~—/ and / gives 


ld,= =f f(x) sin dx. 


This follows immediately from the relations (see Exercise 19) 


l 
NIX oog ktx p10, k#n 
ik 7 COS dx + ken (6) 
'onwx . karx 
[ cos ——— sin——— dx =0 (7) 
- l l 
and 
/ 
NIX i ROX _|0, k#n 
[ sint hs j j dx 1? ken’ (8) 


Hence, the coefficients c, and d, must equal the Fourier coefficients a, and 
b,. In particular, therefore, a function f can be expanded in one, and only 
one, Fourier series on the interval —/<x</. 


Example 3. Find the Fourier series for the function f(x) =cos”x on the in- 
terval —-7w7<x<7. 
Solution. By the preceding remark, the function f(x) =cos’x has a unique 
Fourier series 

“0 


a < ~ 
> + S | a,cos™ + b, sin“ |= x + 2 [ a, cosnx + b, sinnx | 


*It can be shown that it is permissible to integrate the series (5) term by term. 
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on the interval —7<x <7. But we already know that 


, 1 _ cos2x 
cos’ x = = + ———_.. 
2 2 
Hence, the Fourier series for cos?x on the interval —1<x<a must be 
++ 45cos2x. 


The functions cosn7x// and sinn7x /1, n=1,2,... all have the interest- 
ing property that they are periodic with period 2/; that is, they repeat 
themselves over every interval of length 2/. This follows trivially from the 
identities 


nw _ nTX ene LUX 
cos = (x+2/) = cos( — +2nm} cos — 
and 
nT _ oi ( ATX = in NOX 
sin = (x+2/)= sin( — + 2ner] sin. 


Hence, the Fourier series (3) converges for all x to a periodic function 
F(x). This function is called the periodic extension of f(x). It is defined by 
the equations 


F(x)=f(x) —-Il<x<l 
F(x)=3[f(O+f(-D], x= 4! 
F(x+21)=F(x). 

For example, the periodic extension of the function f(x)= x is described in 


Figure 1, and the periodic extension of the function f(x)=|x| is the saw- 
toothed function described in Figure 2. 


Figure 1. Periodic extension of f(x)=x 


-42 -31 -22 


Figure 2. Periodic extension of f(x)=|x| 
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EXERCISES 


In each of Problems 1-13, find the Fourier series for the given function f 
on the prescribed interval. 


_/-l, -l<x<0. 
1 f= | 1 O<x<l’ Ix] <1 


2. f(x={* —2<x<0. \x| <2 


O0<x <2’ 
—_y _{/—-x, ~-l<x<0. 
3. f(x)=x; -1<x<l 4. f(x) % TOS*ST: llc! 
l -—-2<x<0 
5. f(x)=}0, O<x<l; |x|<2 
l, l<x<2 
_/0, -—2<x<l. 
6S) (3 l<xe2? PIS? 
_| 0, —-l<x<0. < 
7 $0) (es O<x<l’ lx </ 
= e~, —l<x<0. </l 
8. £0) 0, O<x<l’ | 
_ e*, —-l<x<0. < </ 
9. f(x) on 0<x<l’ XS 
10. f(x)=e*; |x| </ IL. f(x)=e™*; |x| </ 
12. f(x)=sin?x; |x|<a 13. f(x)=sin?x; |x| <a 


14. Let f(x) =(acosax)/2asinaz, a not an integer. 
(a) Find the Fourier series for f on the interval -7<x<-z7. 
(b) Show that this series converges at x =7 to the value (7 /2a)cotza. 
(c) Use this result to sum the series 


l l 1 l 


—_——. + +.... 
12— a2 22 — q? 32 — q2 


15. Suppose that f and f’ are piecewise continuous on the interval —/ <x </. Show 
that the Fourier coefficients a, and b, approach zero as n approaches infinity. 


16. Let 


f= Z+ S | 4, cos“7* +b, sin 2 |. 


n=] 


Show that 
ae 2 
Tf Podar= z+ D (+07). 
~ n=1 


This relation is known as Parseval’s identity. Hint: Square the Fourier series 
for f and integrate term by term. 
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17. (a) Find the Fourier series for the function f(x)= x’ on the interval -7<x< 
7. 


(b) Use Parseval’s identity to show that 
at 5a t gat = 90° 


18. If the Dirac delta function 6(x) had a Fourier series on the interval —:< x <l, 
what would it be? 


19. Derive Equations (6)-(8). Hint: Use the trigonometric identities 
sin A cos B= 1[sin(A + B)+sin(A — B)] 
sin A sin B= 1[cos(A — B) —cos(A + B)] 
cos Acos B= }[cos(A + B)+cos(A — B)]. 


5.5 Even and odd functions 


There are certain special cases when the Fourier series of a function f re- 
duces to a pure cosine or a pure sine series. These special cases occur when 
f is even or odd. 


Definition. A function f is said to be even if f(— x)=f (x). 

Example 1. The function f(x)= x? is even since 
f(-x)= (x) =x? =f(x). 

Example 2. The function f(x)=cosnzx// is even since 


f(—x)=cos = =cos7> = f(x). 


Definition. A function f is said to be odd if f(— x)= — f(x). 


Example 3. The function f(x)=x is odd since 


f(-x)=—x=-F(2) 


Example 4. The function f(x)=sinnzx// is odd since 


f(—x)=sin — = -sin = = — f(x). 


Even and odd functions satisfy the following elementary properties. 


1. The product of two even functions is even. 
2. The product of two odd functions is even. 
3. The product of an odd function with an even function is odd. 


The proofs of these assertions are trivial and follow immediately from the 
definitions. For example, let f and g be odd and let h(x)=/(x) g(x). This 
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function h is even since 


h(— x)=f(—x)8(—x)=[—S(x)][ —8(*) ] =f) a(x) =A). 
In addition to the multiplicative properties 1-3, even and odd functions 
satisfy the following integral properties. 


4, The integral of an odd function f over a symmetric interval [—/,/] is 
zero; that is, [ : f(x)dx=0 if f is odd. 


5. The integral of an even function f over the interval [—/,/] is twice the 
integral of f over the interval [0,/]; that is, 


I / 
f f(x)dx=2 [ f(x)dx 
—] 0 
if f is even. 
PROOF OF PROPERTY 4. If f is odd, then the area under the curve of f be- 
tween —/ and 0 is the negative of the area under the curve of f between 0 


and I. Hence, f ' f(x)dx =0 if f is odd. 0 
—/ 


PROOF OF PROPERTY 5. If f is even, then the area under the curve of f be- 
tween —/ and 0 equals the area under the curve of f between 0 and /. 
Hence, 


I 0 ] j 
[ fedde= [Up ojact ['fedx=2 [sade 
if f is even. 2 


Concerning even and odd functions, we have the following important 
lemma. 


Lemma 1. 
(a) The Fourier series for an even function is a pure cosine series; that 
is, it contains no terms of the form sinntx /I. 
(b) The Fourier series for an odd function is a pure sine series; that is, 
it contains no terms of the form cosnix / I. 


PROOF. (a) If f is even, then the function f(x)sinn7x// is odd. Thus, by 
Property 4, the coefficients 


b= Tf f(x) sin = dx, n=1,2,3,... 


in the Fourier series for f are all zero. 
(b) If f is odd, then the function f(x)cosnzx// is also odd. Conse- 
quently, by Property 4, the coefficients 


a, = an f(x) cos dr, n=0,1,2,... 


in the Fourier series for f are all zero. C) 
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We are now in a position to prove the following extremely important ex- 
tension of Theorem 2. This theorem will enable us to solve the heat con- 
duction problem of Section 5.3 and many other boundary-value problems 
that arise in applications. 


Theorem 3. Let f and f’ be piecewise continuous on the interval 0< x <1. 
Then, on this interval, f(x) can be expanded in either a pure cosine series 


5+ Sa, cos, 


n=1 


or a pure sine series 


MES 
Ss b, sin —— 


n=] 


In the former case, the coefficients a, are given by the formula 


l 
a,= 5 { f(x)cos™* dr, n=0,1,2,... (1) 
0 


while in the latter case, the coefficients b, are given by the formula 
2! AMX 
b,, = al f(x) sin — dx. (2) 
0 


Proor. Consider first the function 
f(x), 0 
f ( —x ), —l 


The graph of F(x) is described in Figure 1, and it is easily seen that F is 
even. (For this reason, F is called the even extension of f.) Hence, by 


<x<l 
<x<0 


F(s)=| 


f(x) 


Figure 1. Graph of F(x) 
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Lemma 1, the Fourier series for F on the interval —/< x </1s 


F(x)= = + do cos ; a 


n 


(F (x) cos“ dx. (3) 

Now, observe that the function F(x)cosn7x // is even. Thus, by Property 5 
= a= F(x Joos dx = 5 2 if f(x) cos“ de. 

Finally, since F(x)= f(x), 0<x </, we conclude from (3) that 


f= P+ Do cos*, O0<x<il. 


Observe too, that the series 6) converges to f(x) for x=0 and x=/. 
To show that f(x) can also be expanded in a pure sine series, we con- 
sider the function 


f(x), 0<x<l 
G(x)=} —f(-—x), —-l<x<0 
0, x=O0,+1. 
f(x) 
2 2 x 


Figure 2. Graph of G(x) 


The graph of G(x) is described in Figure 2, and it is easily seen that G is 
odd. (For this reason, G is called the odd extension of f.) Hence, by 
Lemma 1, the Fourier series for G on the interval —/< x </is 


G(x)= Ds, sin; = an G(x) sin dx. (4) 
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Now, observe that the function G(x) sinnzx// is even. Thus, by Property 
5, 


=F [ G(a)sin™ dea [ S0o)sin "Fa 


Finally, since G(x)=f(x),0< x </, we conclude from (4) that 


f(x)= > osin O<x<i. 


Observe too, that the series (4) is zero for x =0 and x =/. O 


Example 5. Expand the function f(x)=1 in a pure sine series on the inter- 
val O<x<z. 


Solution. By Theorem 3, f(x)= >) ~_,b,sinnx, where 


n=l" 


> pn 7 0, mneven 
b,= = [ sinnx dx = —(1—cosn7)=, 4, nodd. 
7 Jo nv — 


nn 
Hence, 


j=4 sinx , sin3x , sindx, O<x<n. 
T| | 3 5 


Example 6. Expand the function f(x) =e” in a pure cosine series on the in- 
terval0<x<l. 


Solution. By Theorem 3, f(x)=a)/2+ >, ~_,a,cosnmx, where 


n=] a, 


l 
ap=2[ e*dx=2(e-1) 


and 
1 1 
=2]/ e*cosnmxdx=2Re| ee!" dx 
J i 
1 ; l+inw __ 
=2Re [ e(l+imsdx=2Re| —— | 
0 1+ ina 
(ecosna — 1)(1— inz) 2(ecosnz — 1) 
=?Re —_—_— eee = —_———————— 
l+n?x2 1+n*7? 
Hence, 


ecosni — | 
era e—142 5 (econ) +9 = cos O<x<l. 


n=} 
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EXERCISES 


Expand each of the following functions in a Fourier cosine series on the 
prescribed interval. 


1. f(x)=e7*; O<x<l 2. f= {> ex 853 O<x<2 


-~J/x, O<x<a, 
3. F(x) {* a<cx<2a’ O<x<2a 


4. f(x)=cos?x; O<x<a 
_ x, O<x</1/2. 
8. fxy={ )_* Wr<x<p O<*s'! 


Expand each of the following functions in a Fourier sine series on the pre- 
scribed interval. 


=—p-*: _/0, O<x<l. 

6. f(x)se*;  Ocxcl 7. f(x) {? ISESb3 O<x<2 
~!/xX, O<x<a., 

8. f(x) {* we S5qi | OS® <2a 

9. f(x) =2sinx cos x; 0<x<a 
_ x, O<x<l/2. 

10. f(x)={ ,_* Wr<x<l 0<x</l 

11. (a) Expand the function f(x)=sinx in a Fourier cosine series on the interval 
O<cx<a. 

(b) Expand the function f(x)=cos.x in a Fourier sine series on the interval 0< 

Xx<T%. 


(c) Can you expand the function f(x)=sinx in a Fourier cosine series on the 
interval — a7 < x < 7? Explain. 


5.6 Return to the heat equation 

We return now to the boundary-value problem 
Ou _ y29°u. u(x,0)=f(x) (1) 
of Ox? u(0,t)=u(I,t)=0 

We showed in Section 5.3 that the function 


nN7X ean an t/P 


io 2) 
u(x,t)= > Ch sin 
n=} 


is (formally) a solution of the boundary-value problem 


Ou _ 29m. = _ 
a =O ay? u(0,t)=u(/,t)=0 (2) 
for any choice of constants c,,c,,.... This led us to ask whether we can 
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find constants c,,c,,... such that 
ioe) 
u(x,0)= > ¢,sin™* = f(x), O<x<i. (3) 
=] 


As we showed in Section 5.5, the answer to this question is yes; if we 
choose 


=F f(x) sin ard 


then the Fourier series >} wn 1on sinnax /1 converges to f(x) if f is continu- 
ous at the point x. Thus, 


v 


u(x,t)= 


> | f f(x) in ain ae an mme/P (4) 


n=] 
is the desired solution of (1). 


Remark. Strictly speaking, the solution (4) cannot be regarded as the solu- 
tion of (1) until we rigorously justify all the limiting processes involved. 
Specifically, we must verify that the function u(x,t) defined by (4) actually 
has partial derivatives with respect to x and ¢, and that u(x,7?) satisfies the 
heat equation u,= a7u,.. (It is not true, necessarily, that an infinite sum of 
solutions of a linear differential equation is again a solution. Indeed, an 
infinite sum of solutions of a given differential equation need not even be 
differentiable.) However, in the case of (4) it is possible to show (see Ex- 
ercise 3) that u(x,t) has partial derivatives with respect to x and ¢ of all 
orders, and that u(x, 7) satisfies the boundary-value problem (1). The argu- 
ment rests heavily upon the fact that the infinite series (4) converges very 
rapidly, due to the presence of the factor e~*””‘/”. Indeed, the function 
u(x,t), for fixed t>0, is even analytic for 0<x</. Thus, heat conduction 
is a diffusive process which instantly smooths out any discontinuities that 
may be present in the initial temperature distribution in the rod. Finally, 
we observe that lim, _,,,u(x,7)=0, for all x, regardless of the initial temper- 
ature in the rod. This is in accord with our physical intuition that the heat 
distribution in the rod should ultimately approach a “steady state”; that is, 
a state in which the temperature does not change with time. 


Example 1. A thin aluminum bar (a*=0.86 cm’/s) 10 cm long is heated to 
a uniform temperature of 100°C. At time t=0, the ends of the bar are 
plunged into an ice bath at 0°C, and thereafter they are maintained at this 
temperature. No heat is allowed to escape through the lateral surface of the 
bar. Find an expression for the temperature at any point in the bar at any 
later time ¢. 
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Solution. Let u(x,t) denote the temperature in the bar at the point x at 
time ¢. This function satisfies the boundary-value problem 


u(x,0)=100, O<x<10 
u(0,t)=u(10,t)=0 


Ou O7u 


4 0.9624; | (5) 


dt dx2” 
The solution of (5) is 


i ~0.86n? 2? 
u(x,t) = > c, sin “2X g — 0.86n?n?1/100 


where 


_ 1 /7'? _ mmx , 200 ,, _ 
C, = 5 100 sin 10 dx = om —(1-—cosnz). 


Notice that c, =0 if n is even, and c, =400/nz if n is odd. Hence, 


oo sin(2n+ 1) [0 
un="* 2 Gay ~~ Qntl) 


n=] 


—0.86(2n + 1)*2/100 


There are several other problems of heat conduction which can be 
solved by the method of separation of variables. Example 2 below treats 
the case where the ends of the bar are also insulated, and Exercise 4 treats 
the case where the ends of the bar are kept at constant, but nonzero tem- 
peratures 7, and 7). 


Example 2. Consider a thin metal rod of length / and thermal diffusivity 

a’, whose sides and ends are insulated so that there is no passage of heat 

through them. Let the initial temperature distribution in the rod be f(x). 

Find the temperature distribution in the rod at any later time ¢. 

Solution. Let u(x,t) denote the temperature in the rod at the point x at 

time ¢. This function satisfies the boundary-value problem 
du = got. u(x,0)=f(x), O<x<l 
‘or Ox? u,(0,t) =u, (1,t) =0 

We solve this problem in two steps. First, we will find infinitely many solu- 

tions u,,(x,t)=X,(x)T,(2) of the boundary-value problem 


u,=a°U,,; u,.(0,¢) = u,(/, t) =0, (7) 


and then we will find constants ¢o,c,,C>,... such that 


W(x,t)= S cgay (2) 
n=0 


(6) 


satisfies the initial condition u(x,0)= f(x). 
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Step 1: Let u(x,t)=X (x)T (2). Computing 


du a*u 


a =XT’ and 52 =X"T 
we see that u(x,7/) is a solution of u,=a7u,, if 
bw xX” T’ 
XT’=a°X"T, or y= — (8) 


As we showed in Section 5.3, Equation (8) implies that 
X"+dA\X=0 and T’+dAa?T=0 
for some constant A. In addition, the boundary conditions 
O=u,(0,t)=X'(0)T(t) and O0=u,(Lt)=X'U)T (2) 
imply that X’(0)=0 and X’(/)=0. Hence u(x, ?t)= X (x)T(2) 1s a solution 
of (7) if 
X"+XAX =0; X’(0)=0, X’(/)=0 (9) 
and 
T’+da°T =0. (10) 


At this point, the constant A is arbitrary. However, the boundary-value 
problem ) has a nontrivial solution X (x) (see Exercise 1, Section 5.1) 
only if A=n’n?/1?, n=0,1,2,..., and in this case 
X (x)= X,, (x) =cos war 
Equation (10), in turn, implies that T(#)=e7*" ‘mt/l Hence, 
u,,(x,t)= cos ee reait/l 


is a solution of (7) for every nonnegative integer n. 
Step 2: Observe that the linear combination 


u(x, )=5 + Ss C, COS—— aa ee reat /P 
n=! 
is a solution (formally) of (7) for every choice of constants cy,c,,C,.... Its 
initial value is 


u(x, => + > Ch cos 


Thus, in order to satisfy the initial condition u(x,0)= f(x) we must choose 
constants Co,C),C,... such that 


fay= $+ Do ost O<x<l. 


In other words, we must expand f in a Fourier cosine series on the interval 
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0<x</. This is precisely the situation in Theorem 3 of Section 5.5, and 
we conclude, therefore, that 


= maf f(x) cos “* dx. 


Hence, 

_ 1 l 2 . NNX NUX on? n?t/P? 11 
u(x,t) j [ f@ax+ j 2 | [reo “ax cos 7 é (11) 
is the desired solution of (6). 


Remark. Observe from (11) that the temperature in the rod ultimately ap- 
proaches the steady state temperature 


rf fax 


This steady state temperature can be interpreted as the “average” of the 
initial temperature distribution in the rod. 


EXERCISES 


1. The ends x=0 and x=10 of a thin aluminum bar (a?=0.86) are kept at 0°C, 
while the surface of the bar is insulated. Find an expression for the temperature 
u(x,t) in the bar if initially 
(a) u(x,0)=70, 0<x<10 
(b) u(x,0)=70cosx, 0<x< 10 

_ { 10x, O<x<5 
(c) u(x, = | 10(10—x), 5<x<10 
= 0, 0<x<3 
(e) u(x) (63, 3<x<10 

2. The ends and sides of a thin copper bar (a= 1.14) of length 2 are insulated so 

that no heat can pass through them. Find the temperature u(x, 7) in the bar if 


initially 
(a) u(x,0)=6Scos*7x, O<x<2 
(b) u(x,0)=70sinx,  O<x<2 


_ { 60x, 0<x<l 
(c) u(x,0)= | 60(2— x), l< x <2 
_} 0, O<x<l 
(a) u(x, 0) 15, 1<x<2 
3. Verify that the function u(x,t) defined by (4) satisfies the heat equation. Hint: 
Use the Cauchy ratio test to show that the infinite series (4) can be differenti- 
ated term by term with respect to x and t. 


4... is a solution u(x, 2) 


4. A steady state solution u(x, 7) of the heat equation u,= a 
which does not change with time. 
(a) Show that all steady state solutions of the heat equation are linear functions 


of x; 1e., u(x)=Ax+B. 
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(b) Find a steady state solution of the boundary-value problem 
u,=aru,,; u(0O,t)=T,, u(Ld=T>. 
(c) Solve the heat conduction problem 


u(x,0)=75, O<x<l 


2, . 
Me Oe panan u(1,t)=60 


Hint: Let u(x, t)=0(x)+w(x,t) where v(x) is the steady state solution of 
the boundary-value problem u, = a7u,,; u(0,f)=20, u(1,1)=60. 


5. (a) The ends of a copper rod (a?= 1.14) 10 cm long are maintained at 0°C, 

while the center of the rod is maintained at 100°C by an external heat source. 
Show that the temperature in the rod will ultimately approach a steady state dis- 
tribution regardless of the initial temperature in the rod. Hint: Split this problem 
into two boundary-value problems. 
(b) Assume that the temperature in the rod is at its steady state distribution. At 
time t=0, the external heat source is removed from the center of the rod, and 
placed at the left end of the rod. Find the temperature in the rod at any later 
time ¢. 


6. Solve the boundary-value problem 


u(x,0)=cosx, O<x<!1 


U, =U, + U; 
u(0,t)=0, u(1,t)=0. 


5.7 The wave equation 


We consider now the boundary-value problem 


_ u(x,0)=f(x), u,(x,0)=g(x) 
—— =; (1) 
ot? 0x? u(0,t)=u(,t) =0 

which characterizes the propagation of waves in various media, and the 
mechanical vibrations of an elastic string. This problem, too, can be solved 
by the method of separation of variables. Specifically, we will (a) find solu- 
tions u, (x, t)= X,(x)T,(2) of the boundary-value problem 


07u 5 07u 
— = (*—_;; u(0,t)=u(l,t) =0 2 
Hear (00) =u(h0 @) 


and (b) find the solution u(x, Z) of (1) by taking a suitable linear combina- 
tion of the functions wu, (x, f). 
(a) Let u(x,t)= X (x)T(t). Computing 
2 2 
o*u =XT” and o*u =—=X¥"T 
or? Ox? 


we see that u(x,f)=X (x)T(#) is a solution of the wave equation u,,=c*u,, 
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if XT” =c*X"T, or 
rx" (3) 


Next, we observe that the left-hand side of (3) is a function of ¢ alone, 
while the right-hand side is a function of x alone. This implies that 
T’_ yx" 
eT A 
for some constant A. In addition, the boundary conditions 
O=u(0,t)=X(0)T(t), and O=u(/,t)=X (1)T(t) 


imply that X (0)=0 and X (/)=0. Hence u(x, t)= X (x)T(f#) is a solution of 
(2) if 
X” +AX =0; X (0)= X (/) =0 (4) 
and 
T” +dc?T=0. (5) 
At this point, the constant A is arbitrary. However, the boundary-value 
problem (4) has a nontrivial solution X (x) only if =A, =n?22//’, and in 
this case, 
X (x)=X,, (x)=sin=. 
Equation (5), in turn, implies that 


T(t)= T,, (1) =a, cos 7 +b, sin =e 
Hence, 
u,,(x, j=sin™ | a, cos a + b, sin —— a 


is a nontrivial solution of (2) for every positive integer n, and every pair of 
constants a,,b,. 
(b) The linear combination 


1°) 
u(x,t)= S sin ~ | a, cos + 5, sin 
n=1 


formally satisfies the boundary-value problem (2) and the initial conditions 
(oe) 00 
u(x,0)= > a,sin* and u,(x,0)= <b, sin 
n=] n=] 
Thus, to satisfy the initial conditions u(x,0)=f(x) and u,(x,0)= g(x), we 
must choose the constants a, and b, such that 
oO 


f(x)= a, sin and (x)= > ™o, sin 


n=} n=1 
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on the interval 0<x</. In other words, we must expand the functions 
f(x) and g(x) in Fourier sine series on the interval 0<x</. This is pre- 
cisely the situation in Theorem 3 of Section 5.5, and we conclude, there- 
fore, that 
ns 2 ¢' _ AMX 
sin——dx and 5b, =—— x) sin —— dx. 
=i f(x) sin dx vac J, 8(x)8in “F 

For simplicity, we now restrict ourselves to the case where g(x) 1S zero; 
that is, the string is released with zero initial velocity. In this case the dis- 
placement u(x,7f) of the string at any time ¢>0 is given by the formula 


u(x,t)= Sa, sin“7* co sm a, = = f f(a)sin™ TX d (6) 


n=] 
There is a physical significance to the various terms in (6). Each term rep- 
resents a particular mode in which the string vibrates. The first term (7 = 1) 
represents the first mode of vibration in which the string oscillates about 
its equilibrium position with reaueney 
= loiame_ 
12a 1 = 3] 


This lowest frequency is called the fundamental frequency, or first har- 
monic of the string. Similarly, the nth mode has a frequency 


cycles per second. 


®, = >—- —— =nw, cycles per second 


which is called the nth harmonic of the string. 

In the case of the vibrating string, all the harmonic frequencies are in- 
teger multiples of the fundamental frequency w,. Thus, we have music in 
this case. Of course, if the tension in the string is not large enough, then the 
sound produced will be of such very low frequency that it is not in the aud- 
ible range. As we increase the tension in the string, we increase the 
frequency, and the result is a musical note that can be heard by the human 
ear. 


Justification of solution. We cannot prove directly, as in the case of the heat 
equation, that the function u(x, 7) defined in (6) is a solution of the wave 
equation. Indeed, we cannot even prove directly that the infinite series (6) 
has a partial derivative with respect to t and x. For example, on formally 
computing u,, we obtain that 


NIC - nhamx ._ nact 
u,= — 7 an Sin sin 


and due to the presence of the factor n, this series may not converge. How- 
ever, there is an alternate way to establish the validity of the solution (6). 
At the same time, we will gain additional insight into the structure of 
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u(x,t). Observe first that 


sin 2 cog AE = i sin (x —ct)+sin (x+ ct) 
l l 2 l l 
Next, let F be the odd periodic extension of f on the interval —l<x<J; 
that is, 


f(x), 0<x<l 


—f(-x), —Il<x<0 and F(x+2/)= F(x). 


F(x)= | 
It is easily verified (see Exercise 6) that the Fourier series for F is 


=< ! 
F (x)= > ¢, Sin, = 4 f f(x) sin = dx. 
n=] 0 


Therefore, we can write u(x, f) in the form 
u(x,t)=3[ F(x—ct)+ F(x+ct) | (7) 


and it is now a trivial matter to show that u(x, /) satisfies the wave equa- 
tion if f(x) has two continuous derivatives. 

Equation (7) has the following interpretation. If we plot the graph of the 
function y = F(x— ct) for any fixed t, we see that it is the same as the 
graph of y= F(x), except that it is displaced a distance ct in the positive x 
direction, as shown in Figures la and 1b. Thus, F(x — ct) is a wave which 
travels with velocity c in the positive x direction. Similarly, F(x+ ct) is a 
wave which travels with velocity c in the negative x direction. The number 
c represents the velocity with which a disturbance propagates along the 
string. If a disturbance occurs at the point xo, then it will be felt at the 
point x after a time t=(x — x9)/c has elapsed. Thus, the wave equation, or 
some form of it, characterizes the propagation of waves in a medium where 
disturbances (or signals) travel with a finite, rather than infinite, velocity. 


Figure | 
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EXERCISES 
Solve each of the following boundary-value problems. 


u(x,0)=cosx—1, u,(x,0)=0, O0<x<27 


1. u,=c*u,,; 
MC tox Anan u(2,t)=0 


u(x,0)=0, u,(x,0)=1, 0<x<! 


2. u,=c*u,,; 
MC tix ope, u(1,t)=0 


yp = C7 Ug x, O<x<l, u,(x,0)=0 
3. u ; u(x,0)= l, l<x<2 
u(0, t) =u(3, 7) =0 3-x. exc3 


u(x,0)=xcos7x/2, u,(x,0)=0, 0<x<!l 


4. Uy = C7 U3 
MS © tax ope u(1,t)=0 


5. A string of length 10 ft is raised at the middle to a distance of | ft, and then 
released. Describe the motion of the string, assuming that c?=1. 


6. Let F be the odd periodic extension of f on the interval —/<x</. Show that 
the Fourier series 


lo 2) 
2 / _ NTX _ NWTX 
7 | f-1)sin asin 
a= 


converges to F(x) if F is continuous at x. 


7. Show that the transformation = x — ct, n= x + ct reduces the wave equation to 
the equation u,, =0. Conclude, therefore, that every solution u(x, t) of the wave 
equation is of the form u(x, t)= F(x — ct)+ G(x + ct) for some functions F and 
G. 


8. Show that the solution of the boundary-value problem 


u.=CeUu..: u(x,O)=f(x), u(x,0)=g(x), —l<x<l 
nme u(0,t)=u(I,t)=0 


x+cl 


u(x,t)= 5 LF (x— ct) + F(x+ et) + = f* g(s)ds 


where F is the odd periodic extension of f. 


9, The wave equation in two dimensions is u,,=c?(u,,+ u,,). Find solutions of 
this equation by the method of separation of variables. 


10. Solve the boundary-value problem 


u(x,0)=f(x), u,(x,0)=0, O<x</ 


= Cy, + U5 
MC Mt ope u(/,t)=0 
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5.8 Laplace’s equation 


We consider now Laplace’s equation 


2 2 
dtu, dU _o (1) 
dx? dy? 


As we mentioned in Section 5.2, two important boundary-value problems 
that arise in connection with (1) are the Dirichlet problem and the Neuv- 
mann problem. In a Dirichlet problem we seek a function u(x,y) which 
satisfies Laplace’s equation inside a region R, and which assumes pre- 
scribed values on the boundary of R. In a Neumann problem, we seek a 
function u(x,y) which satisfies Laplace’s equation inside a region R, and 
whose derivative in the direction normal to the boundary of R takes on 
prescribed values. Both of these problems can be solved by the method of 
separation of variables if R is a rectangle. 


Example 1. Find a function u(x,y) which satisfies Laplace’s equation in 
the rectangle 0< x <a, 0<y<b and which also satisfies the boundary 
conditions 


u(x,0)=0, u(x,b)=0 
u(O,y)=0, u(ay)=f(y) 


(2) 


Solution. We solve this problem in two steps. First, we will find functions 
u,,(x,y)= X,(x)Y,,(y) which satisfy the boundary-value problem 


u,, + u,, =0; u(x,0)=0, u(x,b)=0, u(0,y)=0. (3) 


Then, we will find constants c, such that the linear combination 
Lo 2) 
u(x,y)= 2 Calta (,Y) 
n=] 


satisfies the boundary condition u(a,y)=f(y). 
Step 1: Let u(x,y)=X (x) Y(y). Computing u,,=X” Y and u,,=XY”, we 
see that u(x,y)=X (x) Y(y) is a solution of Laplace’s equation if X” Y+ 
XY” =0, or 

Y” xX” 

a a © 


Next, we observe that the left-hand side of (4) is a function of y alone, 
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while the right-hand side is a function of x alone. This implies that 


for some constant A. In addition, the boundary conditions 
O= u(x,0)= YX (x) Y (0), O=u(x,b)=X (x) Y (5), 
0=u(0,y)=X (0) Y(y) 
imply that Y(0)=0, Y(b)=0, and X (0)=0. Hence u(x,y)=XY is a solu- 
tion of (3) if 
Y”+AY=0; Y(0)=0, Y(b)=0 (5) 
and 
X”—-XX =0, X (0) =0. (6) 
At this point the constant A is arbitrary. However, the boundary-value 


problem (5) has a nontrivial solution Y(y) only if \=A, =n?a?/b’, and in 
this case, 


Y(y)= Y, (vy) =sinnay /b. 


Equation (6), in turn, implies that X,(x) is proportional to sinhnax/b. 

(The differential equation X” —(n?2?/b*)X =0 implies that X (x)= 

c,coshnrx /b+c,sinhnrx/b for some choice of constants c,,c,, and the 

initial condition X (0) =0 forces c, to be zero.) We conclude, therefore, that 
u, (x,y) = sinh a sin — 

is a solution of (3) for every positive integer n. 

Step 2: The function 


oo 
. . A 
u(x,y)= » Ch sinh 7 sin > 
n=] 
is a solution (formally) of (3) for every choice of constants c,,c,,.... Its 
value at x =a 1s 


oO 
_ _nma .. aay 
u(a,y)= » C, sinh —~sin——. 


n=1 
Therefore, we must choose the constants c,, such that 
1°.@) 
_ _ 1 mma. my 
f(y)= >, 6 sinh 4 sin, O<y<b. 


n=1 


In other words, we must expand f in a Fourier sine series on the interval 
0<y<b. This is precisely the situation described in Theorem 3 of Section 
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5.5, and we conclude, therefore, that 


C, = — [ f(y)sin—= n=1,2,.... 


b sinh —— az 


Remark. The method of separation of variables can always be used to 
solve the Dirichlet problem for a rectangle R if u is zero on three sides of 
R. We can solve an arbitrary Dirichlet problem for a rectangle R by split- 
ting it up into four problems where uw is zero on three sides of R (see Ex- 
ercises 1—4). 


Example 2. Find a function u(x,y) which satisfies Laplace’s equation in 
the rectangle 0< x <a, 0<y<b, and which also satisfies the boundary 
conditions 


u,(x,0)=0, u,(x,b)=0 


(7) 
u,(0,y)=0, u,(ay)=f(y) 


u, =0 a 


Solution. We attempt to solve this problem in two steps. First, we will find 
functions u,,(x,y)= X,(x) Y,(y) which satisfy the boundary-value problem 


U,, + u,, =0; u,(x,0)=0, u,(x,b)=0, and u,(0,y)=0. (8) 
Then, we will try and find constants c, such that the linear combination 


u(x,y)= > n=olnUn(X,y) satisfies the boundary condition u,(a,y)=f(y). 
Step 1: Set u(x,y)=X (x) Y(y). Then, as in Example 1, 


for some constant A. The boundary conditions 
O=u,(x,0)=X(x)¥'(0), O=u,(x,b)=X (x) ¥'(), 
= u,(0,y)=X'(0) Y(y) 
imply that 
Y’(0)=0, Y’(b)=0 and X’(0)=0. 
Hence, u(x,y)= X (x) Y(y) is a solution of (8) if 
Y”+AY=0; Y’(0)=0, Y’(b)=0 (9) 
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and 
—AX =0; X’(0) =0. (10) 


At this point, the constant A is arbitrary. However, the boundary-value 
problem (9) has a nontrivial solution Y(y) only if A=A,=n?2?/b?,n= 
0,1,2,..., and in this case 


Y(y)=Y, (y) =cosnmy/b. 


Equation (10), in turn, implies that X (x) is proportional to coshnzx/b. 
(The differential equation X” — n*1*X /b?=0 implies that X (x)= 
c,coshnrx/b+c,sinhnrx/b for some choice of constants c,,c,, and the 
boundary condition X ’(0)=0 forces c, to be zero.) We conclude, therefore, 
that 


nv 
u, (x,y) =cosh “2* cos 

b b 
is a solution of (8) for every nonnegative integer n. 


Step 2: The function 


n7x nay 
u(xy)=5 + S Ch cosh — cos 
n=] 
is a solution (formally) of (8) for every choice of constants Cy,c),C>,.... The 


value of u, at x=a is 


Therefore, we must choose the constants c,,c,,..., such that 
~ ny 
_ nu... nwa 
fQ)= > e c, sinh ~~ cos, O0<y<b. (11) 


n=1 


Now, Theorem 3 of Section 5.5 states that we can expand f(y) in the 
cosine series 


00 b 7 
s=h f ‘1o)o+ 3 >| [100005 ay Joos (12) 


on the interval 0< y <b. However, we cannot equate coefficients in (11) 
and (12) since the series (11) has no constant term. Therefore, the condi- 
tion 


b 
J f(y) a=0 
is necessary for this Neumann problem to have a solution. If this is the 


511 


5 Separation of variables and Fourier series 


case, then 
2 


b ny 
C, = —~saa | f(y) cos ~ dy, n>. 
nm sinh —— 0 


Finally, note that cy remains arbitrary, and thus the solution u(x,y) is only 
determined up to an additive constant. This is a property of all Neumann 
problems. 


EXERCISES 


Solve each of the following Dirichlet problems. 


1. u,, tu, =0 , u(x,0)=0, u(x,b)=0 
O<x<a, O0<y<b u(ay)=0, u(Oy)=f(y) 

2. uy, + uy, =0  u(0,y)=0, u(a,y)=0 
O<x<a, O<y<b’ u(x,0)=0,  u(x,b)=f(x) 


Remark. You can do this problem the long way, by separation of vari- 
ables, or you can try something smart, like interchanging x with y and 
using the result of Example 1 in the text. 


3 u,, + u,, =0 u(0,y)=0, u(a,y)=0 
"O<x<a, O<y<b’ u(x,b)=0, u(x,0)=f(x) 

4 u,, +u,,=0 u(x,0)=f(x), u(x,b)=g(x) 
O<x<a, O<y<b u(Oy)=h(y), u(a,y)=k(y) 


Hint: Write u(x,y) as the sum of 4 functions, each of which 1s zero on three 
sides of the rectangle. 


5, ux, + uy, =0 _ u(x,0)=0,  u(x,b)=1 
O<x<a, O<y<b u(O,y)=0, u(ay)=1 
6. Ux, + Uy, =0 _ u(x,b)=0, u(x,0)=1 
O<x<a, O<y<b u(0,y)=0, u(ay)=1 
7, U,, tu, =0 . u(x,0)=1, u(x,b)=1 
O<x<a, O<y<b u(0,y)=0, u(a,y)=1 
8. u,, + u,, =0  u(x,0)=1,  u(x,b)=1 
O0<x<a, 0<y<bd’ u(0,y)=1, u(a,y)=1 
Remark. Think! 
9. Solve the boundary-value problem 
Uxx + by =U _ u(x,0)=0, u(x, 1)=0 
0<x<l, O<y<l’ u(y)=0, u(ly)=y 
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10. (a) For which functions f(y) can we find a solution u(x,y) of the Neumann 
problem 


u,, + u,, =0 u(y) =0, 4, (0,y)=f(y) 
O<x<l, O<y<1’ 4,(%,0)=0, u4,(x,1)=0 
(b) Solve this problem if f(v)=sin2zy. 
11. Laplace’s equation in three dimensions is 
U,, tu, +u,, =0. 


Assuming that u=X (x) Y(y)Z(z), find 3 ordinary differential equations satis- 
fied by X, Y, and Z. 
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Sturm-Liouville boundary 
value problems 


6.1 Introduction 


In Section 5.5 we described the remarkable result that an arbitrary piecewise 
differentiable function f(x) could be expanded in either a pure sine series of 
the form 


~ . Nx 
f(x) = X b, sin — (1) 
or a pure cosine series of the form 
a ~ nx 
fix)= > + D4, 008 (2) 


on the interval 0 < x <1. We were led to the trigonometric functions ap- 
pearing in the series (1) and (2) by considering the 2 point boundary value 
problems 


y"+Ay=0, = y(0) = 0, yp) = 0, (3) 
and 

yt+aAy=0, yO=HO0yY=0. (4) 
Recall that Equations (3) and (4) have nontrivial solutions 


. nnNx nnx 
y,(x) = csin —— and y,(x) = coos, 


22 
| 
eigenvalues, and the corresponding solutions were called eigenfunctions. 


respectively, only if A = 4, = . These special values of A were called 
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There was another instance in our study of differential equations where 
something interesting happened for special values of a parameter 4. To wit, 
in Section 2.8 we studied, either in the text or in the exercises, the four 
differential equations 


y” — 2xy + Ay = 0, (5) 
(1 — x*)y” — 2xy’ + 4A + Dy =0, (6) 
(1 —x?)y” — xy’ + A*y = 0, (7) 
and 
xy” +(1—x)y’+Ay =0. (8) 


Equations (5)—(8) are the famous Hermite, Legendre, Tchebycheff, and 
Laguerre differential equations, respectively. The Legendre, Tchebycheff, and 
Laguerre equations each have a polynomial solution of degree n if 4 = n, 
while the Hermite equation has a polynomial solution of degree n if 2 = 2n. 
These polynomials, when properly normalized, i.e.. when multiplied by a 
suitable constant, are known as the Hermite, Legendre, Tchebycheff, and 
Laguerre polynomials. It turns out, remarkably, that any piecewise differ- 
entiable function f(x) can also be expanded in a series of Hermite, Legendre, 
Tchebycheff, and Laguerre polynomials, on an appropriate interval. 

There is a very pretty theory that ties together not only the trigonometric 
functions and the Hermite, Legendre, Tchebycheff, and Laguerre poly- 
nomials, but also many of the other famous functions of mathematical 
physics, such as the various Bessel functions. This theory is commonly called 
Sturm-Liouville Theory; it has its roots, essentially, in an area of linear 
algebra known as inner product spaces, and it is to this area that we now 
turn our attention. 


6.2 Inner product spaces 


Up to this point, our study of linear algebra in general and linear vector 
spaces in particular was algebraic in nature. We were able to add two vectors 
together and multiply a vector by a constant. By means of these operations 
we can define the geometric concepts of dimension, line, plane, and even 
parallelism of lines. Recall that the dimension of a space V is the number of 
elements in a basis, i.e., the fewest number of linearly independent vectors that 
span V. Once we have the concept of dimension, we can define a line in V as 
a subspace of dimension 1, a plane as a subspace of dimension 2, etc. Finally, 
two vectors are parallel if one is a constant multiple of the other. 

Many important geometric concepts of so-called Euclidean geometry, 
however, still cannot be defined for arbitrary linear vector spaces. Specifically, 
we have no way of formulating, as yet, the definition of length of a vector 
and the angle between two vectors. To accomplish this, we need to super- 
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impose on V some additional structure. This structure is known as an inner 
product. 

Our definition of an inner product is modelled after the traditional dot 
product of two vectors in R* and R°, which we studied in calculus. Recall that 
if x and y are vectors in R? or R°, then 


X°Y =X, + X2y2 in R? (1) 
and 
X'Y = Xiyy + X2y2 + X3y3 in R* (2) 


where X,, X2,..., and y,, y>,... are the components of x and y, respectively. 
Recall too, the famous identity, proven in most calculus courses, that 


xy = |x|ly|cos 0 (3) 
where @ is the angle between x and y and 
Ix] = (xp + xg) or (xt + x9 + x3) 


in R* and R°, respectively, is the Euclidean length of x. Rewriting Equation 
(3) in the form 
g=—* (4) 
Ixlly| 
enables us to compute the angle @ between two vectors. Finally, if we observe 
that 


[x| = (xx)? (5) 


then both the length of a vector and the angle between two vectors can be 
computed from the dot product alone. 

We are now ready to generalize this concept of dot product. This general- 
ization will be called an inner product. We will first define a real inner product, 
and then toward the end of this section, a complex inner product. 


Definition. Let V be a real vector space. A real inner product on V is a 
real-valued function that associates with each pair of vectors x, y a real 
number, denoted by <x,y>, that satisfies the following properties: 


(i) <x,y> = <y, x> for all x and y in V, 

(11) <kx,y> = k<x,y> for all scalars k and vectors x, y, 
(i11) <x + y,Z> = <x,z> + <y,z> for all x, y, z in V, 
(iv) <x,x> > 0 and <x,x> = 0 only ifx = 0. 


It is customary to refer to the vector space V, together with some inner 
product <¢ , > as a real inner product space, and a finite-dimensional real 
inner product space is often called a Euclidean space. 
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Remark. Observe from (i) that properties (ii) and (iti) hold equally well on 
opposite sides of the inner product bracket, 1.e., 


<x,ky) =k<x,y> and <x,y +z) = <x y> + (xz). 
The classic example of an inner product on R", of course, is the dot product 
CX YD = XY = Xp + XqV2 to + XpYp- 


It is easily verified that the dot product satisfies properties (i)—(iv). Here 
are some additional examples, which have proven extremely useful in 
applications. 


Example 1. Let V be the space of all continuous functions on the interval 
[a,b], and define the inner product of two functions f(x) and g(x) in V as 


{fg = | Flxg(x) dx. (6) 


It is easily verified that the definition (6) satisfies properties (1)—(iv). 
Example 2. Let V be as in Example 1, and define 


{fg = | r(x) f(xo)g(x) dx (7) 


where r(x) is positive on the open interval (a, b). Again, it is trivial to verify 
that the inner product (7) satisfies properties (i)—(iv). 


Example 3. Let V = R° and set 


CX,Y> = X1Vy + 2X22 + 2X33 — Xp V2 — X2V1 — X2V3 — X3)2- 


Properties (11) and (111) are trivial to check, and property (1) is simple to verify. 
To check property (iv), we write 


Xx,X> = x7 + 2x2 + 2x3 — x1 xX. — XX, — XQX3— X3X> 
= x? + 2x2 + 2x32 — 2x,x. — 2x2x3 
= x? — 2x,x, + x3 + x3 — 2x,x34+x3 + x3 
= (x1 — Xz)? + (x2 — x3)? + x3, 


and it is now clear that <x,x> = 0 if and only if x = 0. 

Let x and y be vectors in an arbitrary inner product space V, and suppose, 
following Equation (5), that we define the lengths of x and y, denoted by ||x|| 
and |ly||, respectively, as 


Ixl =x", llyll=<yy’, (8) 


where the double vertical lines are used to denote length. It is certainly very 
plausible to try to mimic Equation (4) and define the angle 0 between x and 
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y via the equation 


g- 6 ___ &Y_ 
Ixilyll <%x>"?<y,yp"? 


Clearly, the right-hand side of Equation (9) must be <1 in order for this 
equation to have any meaning at all. A second and more subtle requirement 
is that the quantity ||x||, defined as the length of x, really is a length; that is, it 
satisfies the geometric properties usually associated with length. Let us there- 
fore take one last digression and discuss the concept of length of a vector, or 
as it is traditionally called in linear algebra, the norm of a vector. 


(9) 


Definition. A norm on a real or complex vector space V is a real-valued 
function, usually denoted by || ||, which satisfies 
(i) ||x|| > O for all x in V and ||x|| = 0 only if x = 0, 
(11) ||kKx|| = |k| ||x|| for all scalars k, 
(ili) |x + yl] < [xl] + llyl. 


Properties (i) and (ii) are fairly obvious properties of length, while property 
(iii) is simply the triangle law, which states that the length of one side of a 
triangle is always less than (or equal to) the sum of the lengths of the other 
two sides (see Figure 1). 

The classical example of a norm is ordinary length in R"; e., if 


xy 
Xn 
then 


xl] = Get + x9 +o + xR)? 


obviously satisfies (1)—(i1i). Here are some additional examples, which have 
proven quite useful in applications. 


Figure 1 
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Example 4. Let V = R” and define 
xa = [xq] + [x2] + °° + [xl 


It is extremely easy to verify that || ||, is anorm on R". 


Example 5. Let V = R" and define 
XI = max[|X,|,|X2|,...,1X,lJ. 


Again, it is extremely simple to verify that || ||,, defines a norm on R”. 


Example 6. Let V = R” and define 
IXILp = Chea? + [x2l? + °° + |xylP]*”. (10) 


It is extremely simple to verify that || ||,, referred to as the p norm, satisfies 
properties (i) and (ii). Property (111) is also true but quite difficult to verify, 
hence we will not do so here. We wish to point out (see Exercise 6), however, 
that 


|x|]. = lim |x|, 
p?o 


and this is the motivation for the strange notation || ||,,. 


Example 7. Let V be the space of all continuous functions f(x) on the interval 
[0, 1], and define 


1 
ifl= | /Goldx (11) 
It is easily verified that Equation (11) defines a norm on V. 


Let us return now to the questions raised via the definitions (8) and (9). As 
mentioned previously, we must show that the right-hand side of Equation (9) 
is always between —1 and +1 in order for this equation to make sense. But 
this is equivalent to the inequality 


Kx yD] < <x, x)? <y,y"? 


or, equivalently, 


Cx, y>| < [x] lly]. (12) 


Equation (12) is the precise statement of a very famous theorem, known as 
Schwarz’s Inequality, which we now prove. 


Theorem 1 (Schwarz’s Inequality). Let V be a real inner product space with 
inner product < , >. Then 


Kx, y>| < | xl lly (13) 
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for all vectors x, y, where ||x|| is defined by Equation (8). Furthermore, 
equality holds in Equation (13) only if the vectors x and y are linearly 
dependent, i.e., y = kx for some scalar k. 


ProoF. There are several ways of proving Theorem 1, but we will choose here 
the proof that generalizes most easily to the complex case. Observe that 


_ &Yy -Sy) 0 14 
(3 yyy Gy / 7 U4) 


for all nonzero vectors x and y in V. This follows immediately from property 
(iv) of inner products. Next, using properties (i)—(i11) of inner products, we can 
rewrite Equation (14) in the form 


<x, y> <x, y> «x, y>? 
<x, x» ~ <y, X> ~ x, » + <y, y> 
“yy) Cy? Sy? 
2 2 
= (xx) — 20692" 4 SEY? 
yyy  <yY 
<x, y>? 
= {x,x> — 
6%? <y, yD 
<x, y>* 
= ||x|/7 — , 
|x| iyi 
Hence, 
<x, y>? ; 
< ||x 
iyi? = '" 
or, equivalently, 
<x, y>] < [x] lly! 
Finally, again from property (iv), equality holds in Equation (14) only if 
x BY y 8 
cy, yD 
or 
<x, yD 
x = ky, k= ; 
, <y, yD 


Remark 1. The proof we just gave assumes that y 4 0. We leave it to the 
reader to verify (see Exercise 13) Schwarz’s Inequality in the trivial case that 
y = 0. 


Remark 2. The vector (<x, y>/<y, y>)y introduced in Equation (14) was not 
a luckly guess. This vector is actually the projection of the vector x onto the 
vector y, as shown in Figure 2. 
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<X,Y> y 
<Y,Y> 


Figure 2 


The second question raised was whether 
xl] = <x,x>*”? (15) 


satisfies properties (i)—(iii) of norms. Property (1) follows immediately from 
property (iv) of inner products. To verify property (11), we compute 


kx] = <kx, kx"? = [k*<x, x]? = |[k] <x, x>1/? = [A] [Ix]. 
Property (111) is a bit more difficult to verify. Observe first that 
Ix + yl? =<x+y,x+y> 
= &, x) + GY) + (YX) + (YY? 
= ||x||? + 2<x,y> + llyll? 
< ||x|]* + 2])xil yl] + lly? 
= [|x|] + llyll]*, 
where the inequality above follows directly from Schwarz’s Inequality. Tak- 


ing square roots gives the triangle inequality, and this completes our proof. 


Remark. Schwarz’s Inequality is an extremely powerful tool, as witnessed by 
the preceding proof. Here is another illustration of its strength. Let V be the 
space of all continuous functions on the interval [a,b], and define 


{f,g) = | Sloo)g(x) dx. 


Then, from Schwarz’s Inequality, 


KKA9?| = < [FCI lg) I. 


b 1/2 b 1/2 
<| | Pax | Peax| | (16) 


We defy the reader to give a pure calculus proof of the inequality (16). 


| Slx)g(x) dx 


Hence, 


| Slx)g(x) dx 
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Once we have an inner product space V, we can define two vectors in V to 
be orthogonal, or perpendicular to each other, if their inner product is zero; 


1.e., x and y are orthogonal if <x, y> = 0. 


Example 8. Let V be the space of all continuous functions on the interval 


[ —z,7], and define an inner product on V via the relation 


(fray =| flxdge) dx 
The set of functions 
f, (x) = sin kx, k=1,2,..., 
are all mutually orthogonal; that is, 


CFO) AO)? =0, J #K, 


since 


<0.) = | sin jx sin kx dx = 0, jtk 


following the discussion in Section 5.4. 


Example 9. Let V be the space of all continuous functions on the interval 


[—1,1], and define 
F(x)g() 
1./1— x? 


Cf 9> = [ dx 


for f and g in V. The set of functions 
fy. (x) = cos(k cos! x), k =0,1,2,..., 
are mutually orthogonal with respect to ¢ , >, ie., 


[ cos(jcos~* x) cos(k cos” *x) 5 


-1 /1 — x? 

To establish Equation (17) we make the substitution 
u = cos” !x du = a 

/1—x? 


Then the left-hand side of Equation (17) becomes 


=0, j#k. 


T 


0 
-| cos jucos ku du = | cos jucos ku du 


tT 9) 


(17) 


1 +9 
=5| [cos(j + k)u + cos(j — k)u] du = 0, jH#k. 
OQ 
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Remark. It can be shown that the functions f,(x) defined in Example 9 are 
the Tchebycheff polynomials T,(x). 


We conclude this section by extending the notion and definition of inner 
product to complex vector spaces. The main problem we have to overcome 
is the requirement that <x,x)> be positive for all nonzero vectors x. For 
example, if V is the space of all complex-valued functions 


ho=fO+ig, axt<b, 


then we cannot define 


<hysha) = | "hy (Ohg(t) de 


as in Example 1, since 


b 
<h,h) = | Lf(t) + ig IL + ig(t)] at 


is not even real [for < f,g> 4 0], let alone positive. 
To motivate the proper extension, we return to the familiar dot product 


CX,Y> = XY =XpVy + X2QV. + + XpVy (18) 


where x and y are real vectors, 1.e., their components are real numbers. We 
can extend this definition to complex vector spaces by making the simple 
adjustment 


XY = XyVy + XQV_ to + Xp (19) 
Then, following Equation (19), 
<x, X> = X1X1 + X2X> +e + XnXn 


is always real and nonnegative. The only thing we have to be extra careful 
about now is that 


YX = YiXy + + VyXn = GY? 
instead of <x, y> and 
x-ky =x, ky, +-°° + x,ky, = k<x, yd 
instead of k <x, y>. This leads us to the following definition. 


Definition. Let V be a complex vector space. A complex, or Hermitian, inner 
product on V is a complex-valued function that associates with each pair 
of vectors x, y a complex number, denoted by <x,y>, that satisfies the 
following properties: 


(i) <x,y> = <y, xX), 
(li) <kx,y> = k<x, y>, 
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(ill) <x + y,Z> = ¢X,Z) + <y,Z), 
(iv) <x,x> > 0 and <x,x> = 0 only ifx = 0 


for all vectors x and y and complex numbers k. 


Property (i) is known as conjugate symmetry in contrast to the ordinary 
symmetry of the real case. It implies that 


(x, ky> = <ky,x> = k<y,x> = k<y, x) = kx yD. 


Thus, scalars pulled out from the right-hand side of a complex inner product 
must be conjugated. Finally, we leave it to the reader to verify (see Exercise 
14) that 


XY + Z) = Gy) + (XZ). 


Example 10. Let V be the space of all complex-valued functions h(t) = f(t) + 
ig(t) on the interval [0, 1], and define 


<h,,hy) = [. h,(t)ha(@ dt 
= \, [f.(t) + ig, DILAO — ig,(t)] at (20) 


- | Chili falt) + 9s g2(0)) dt + i \. [falt)ai() — frDga(t)1 at 


It is easily verified that Equation (20) satisfies properties (i)—-(iv) of our 
definition. 

Complex inner product spaces are the same as real inner product spaces, 
essentially, except for the conjugate symmetry property, as noted above. For 
example, we define 


xl] = (x, xp"? (21) 


to be the length of a vector x in a complex vector space, just as in a real vec- 
tor space. The only thing that we must do, though, is extend the proof of 
Schwarz’s Inequality to the complex case, since our proof made use of the 
symmetry property of real inner products. 


Theorem 2 (Schwarz’s Inequality). Let V be a complex inner product space 
with inner product < , >. Then 


Kx v1 < [xi lly (22) 


for all x, y in V, where ||x|| is defined by Equation (21). Furthermore, equality 
holds in Equation (22) if and only if the vectors x and y are linearly depen- 
dent. 
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Proor. Observe from property (iv) that 


BYP &Y? 
_ 0 23 
(3 yy? wy” y) °°) 


for all vectors x and nonzero vectors y in V. Using properties (i)—(i1i), we can 
rewrite the left-hand side of Equation (23) in the form 


«xX yD _ &Y? XY) <% YD 
Yyyy PP — Gy OY + Gy Gy 


_ _&Yz—~ _ &Y? <X% YP 5 
= OG) — TS HY? — yyy OY? + 


Kxy>P 
ly 

_ [xIPllyl? — Kx y>P? 

yl 


Cx, x) _ 


<y,y> 


«x, y> 


= |x|]? — 


Hence, 


Kx, yl? < IIxll*llyll?, (24) 


and Schwarz’s Inequality follows immediately by taking square roots in 
Equation (24). 


Remark. As in the real case, our proof assumes that y 4 0. We leave it to the 
reader (see Exercise 13) to complete the proof for y = 0 and to verify that 
equality holds if and only if x and y are linearly dependent. 


EXERCISES 


1. Show that Equation (6) defines an inner product on C[a,b], the space of all 
continuous functions on the interval [a, b]. 


. Show that Equation (7) defines an inner product on C[a, b]. 
. Show that ||x||, defined in Example 4 is a norm. 
. Show that ||x||,, defined in Example 5 is a norm. 


. Show that ||x||, defined in Example 6 is a norm. 


NAN a &» WW N 


. Show that ||x||,, = lim ||x|l,. 


po 
7. Show that ||x|| = |x,| + 2]x,| + 4|x3| is a norm on R®. 
8. Show that |x,] + |x.| + |x3|* does not define a norm on R?. 
9. Show that e*:!*2! does not define a norm on R?. 


10. Show that Equation (11) defines a norm on C[a, b]. 
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11. Let V be a real inner product space. Prove that 


1 1 
_- 2 Hy — wil2 
OGY) = ail + yl — 7 lx — yl. 


12. Let x x y be the vector with components 


X2V3 — X3 V2, X3Y1 — X1 V3, X%1 V2 — X2)1> 


respectively. Show that x x y is perpendicular to both x and y under the inner 
product (x, y). 


13. Complete the proof of Schwarz’s Inequality in both R" and C" for the case y = 0. 
14. Show that 
(YY + Z) = Gy) + (%D 


for all vectors x, y, zin C”. 


6.3 Orthogonal bases, Hermitian operators 


It is often the case that a crucial step in the solution of many problems in a 
linear vector space revolves around a judicious choice of basis. A type of basis 
that is especially useful is one that is orthogonal. 


Definition. A set of vectors is said to be orthogonal if the inner product of any 
two distinct vectors in the set is zero. 


One of the nice things about an orthogonal set of vectors is that they are 
automatically linearly independent. This is the content of Lemma 1. 


Lemma 1. Let x,, x,,...,Xy be mutually orthogonal, that is, 
{X,, x;> = 0, l fF j. 


Then, x,, X2,...,Xy are linearly independent. 


Proor. Suppose that 
CyXy + C2X_ +577 + CyXy = O. (1) 
Taking inner products of both sides of Equation (1) with x; gives 
C1 (X1, Xj) + C2 (XQ, Xj) + 77° + Cy<Xy, X)> = 0. (2) 


The left-hand side of Equation (2) reduces to c;{x,,x;)>. Since ¢x;,x;> > 0, we 
see that c; = 0, j = 1, 2,..., N, which proves Lemma 1. 

Another benefit of working with orthogonal bases is that it is relatively 
easy to find the coordinates of a vector with respect to a given orthogonal 
basis. To wit, let u,,u,,...,u,, be a mutually orthogonal set of vectors in a real 
n-dimensional vector space V. By Lemma 1, this set of vectors is also a basis 
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for V, and every vector x in V can be expanded in the form 
X= C,U, +C,U, +°''+C,U,. (3) 
Taking inner products of both sides of Equation (3) with u, gives <x,u;> = 
c;<u,,u;> so that 
_ <x, U,> 


= , p= 1,2,...,n. 4 
I~ Guy?! " @ 


Example 1. Let V = R* and define <x, y> = x,y, + X2y2. The vectors 


v-() ot e() 


are orthogonal and, thus, form a basis for R*. Consequently, from Equations 


(3) and (4), any vector x = (*:) can also be written in the form 
2 


x= “ay (6 + C 
Ax) OTM *\_1 
_ Xx, U; > ()) 4 <x, u,> ( ) 
<u,,U,> 1 <u, U, > —1 
_ x4 + X42 1 4 X41 “_ X4 1 
2 1 2 —1})" 
Given the benefit of working with orthogonal bases, the following question 
naturally arises: Does every n-dimensional Euclidean space have an ortho- 


gonal basis? The answer to this question is a resounding yes and is given by 
the following theorem. 


Theorem 3 (Gram-Schmidt). Every n-dimensional Euclidean space V has an 
orthogonal basis. 


Proor. Choose a basis u,, u,,...,u, for V. We will inductively construct an 
orthogonal basis v,, v,,...,v, by taking suitable combinations of the vectors 
u,,---,U,- Lo this end, let v; = u, and set 


V> = u, + AV. (5) 
Taking the inner product of v, with v, gives 
(V2, Vi > = (Uy, Vy) + ACV, V4 D 


so that v, will be orthogonal to v, if A = —<u,,v,>/<v,, Vv, >. Note that v, 
cannot equal 0 by virtue of the linear independence of the vectors u,,...,u,,. 
Proceeding inductively, let us assume that v,, v,,...,V, are mutually ortho- 
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gonal, and set 
Viett = Ucar AV, $05 + AGM . (6) 
The requirement that v,,, be orthogonal to v,,...,v, yields 
0= CVi419 Vj> = (U4 V)> + AX; Vp» 


which gives the desired relation 


es Vp? Py . 
A; — = 1, oee 5 k. 
J vv) vi” J 
With this choice of 4,,..., A,, the vectors v,,...,V,, ¥,4, are mutually ortho- 


gonal. Moreover, v,,, cannot be 0 by virtue of the linear independence of 
U,,--->U,4,- Proceeding inductively until k = n, we obtain n mutually ortho- 
gonal nonzero vectors V,,...,¥ 


n° 


Remark 1. The procedure outlined in Theorem 3 is usually referred to as the 
Gram-Schmidt orthogonalization procedure. 


Remark 2. Ifu,,...,u,, 1S an orthogonal set of vectors, and, in addition, each 
of the vectors u, has length 1, i.e., 


ju;|| = <u,,u uj’? = 1, j=1,...,n, 


then u,,...,u, 1s called an orthonormal set of vectors. It is a simple matter to 
construct an orthonormal basis from an orthogonal one: if u,,...,u, is an 
orthogonal basis, then 

Uy _ U, U, 


v,= 


2 — eee 
lu, {| ju, || |, || 


are orthonormal, since 


u. 
lvl = | — 
lu, | 


Example 2. Let V be the space of all polynomials of degree n — 1, and define 


i= 


jg Ml =! 


<fa>=[ flodatx)ax 


for all functions f and g in V. It is easily verified that V is an n-dimensional 
Euclidean space and that 


fo(x) = I, f(x) =x, ees fa—1(x) = x" 


form a basis for V. Applying the Gram-Schmidt orthogonalization procedure 
to the functions fo(x),...,f,-1(x) gives 


Po(x) = 1; 
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<fi»Po> _ 
<Po»Po> [. 1 dx 


p(x) =x? +A,x +A, 


Di(ix)=xtA=H=x- 


where 
1 
| x? +x dx | x? dx 
-1 -1 I 
A> = — i = 0, Ay = 1 = 3 
| x? dx | 1 dx 
~| -1 
so that 
1 
= 2 _— — 
P2(x) x 3 ) 
3 2! 
p3(x) =x +A; > ¢ 3 +A,x+A, 
with 
1 1 
-| x3(x? — 4)dx -| x3-xdx 
43 = ese 8 = 
| (x? — 4)? dx | x? dx 
—1 -1 
and 
1 
-| x3 dx 
A= =0 
| 1 dx 
-1 
so that 
3 
p3(x) = x? — 5. 
These polynomials po(x), p;(x), p2(x), p3(x),..., when properly normalized, 


1e., when multiplied by a suitable constant, are the Legendre Polynomials 
P,(x) discussed in Chapter 2. 

We now digress for a moment to discuss the following seemingly totally 
unrelated problem. Let V = C” with inner product 


CX, Y> = (X,Y) =X V1 + X22 tooo + Xan 


where X,,...,X, and y,,..., y, are the coordinates of x and y, respectively, and 
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let A be a given n x n matrix. Does there exist a matrix B such that 


(Ax, y) = (x, By) (7) 


for all vectors x and y? 
To answer this question, we let A = (a,,), B = (b,;) and observe that 


(Ax, y) = y (s aux) = 


j=l 


t,J 


n 
DEX; 
~ 


and 


(x, By) = 3 03 bus) = y bj ;Xi¥;.- 
i=1 jJ=1 i,j=1 
Hence, Equation (7) is satisfied if and only if b,, = a,, or, equivalently, 
b,, = Gj. (8) 
Definition. The matrix B whose elements are given by Equation (8) is called 


the adjoint of A and is denoted by A*. If A* = A, then A is said to be 
selfadjoint or Hermitian, and 


(Ax, y) = (x, Ay) for all x and y. 


Remark 1. If A is real, then A* is simply A’, the transpose of A. Thus, if A is 
real and symmetric, i.e., A = A', then A is Hermitian. 


Remark 2. It is a simple matter to show (see Exercise 6) that (A*)* = A and 
that 


(x, Ay) = (A*x,y) for all x and y. 


Example 3. Let 


A= 3 —i 1-i 
i 2i 31 
Then, 
1-i 3 —j 
A* = 2 i —2i 
—i il+i —3i 
Example 4. Let 
3 4 
A= |13 4 5 
4 5 6 
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Then A* = A’ = A. 
Selfadjoint matrices have several very important properties, which we now 
wish to describe. 


Property 1. Let A be selfadjoint and let x be an eigenvector of A with 
eigenvalue J. Since A is selfadjoint, (Ax, x) = (x, Ax), but 


(Ax, x) = (Ax, x) = A(x,x), 
while 

(x, Ax) = (x, Ax) = A(x, x). 
Hence, A = /. 


Corollary. The eigenvalues of a real symmetric matrix are real. 


Property 2. Let A be selfadjoint, and let u, and u, be eigenvectors of A with 
eigenvalues 1, and /,, respectively, with 1, #A,. Then, u, and u, are 
orthogonal, 1.e., (u,,u,) = 0. 


Proor. Observe first that 
(Au,,u,) = (A,u,,U,) = A, (uy, Uy). 
Since A is selfadjoint, 
(Au,,u,) = (u,, Au,) = (u,,/,u,) = A,(u,, u,) 
since A, is real. Hence, 


A,(u,,U2) = A2(U;, Uy), (9) 


and since 4, # A,, Equation (9) now forces (u,,u,) = 0, which proves Prop- 
erty 2. 

In actuality, we can state a much stronger property than Property 2: Not 
only are the eigenvectors belonging to distinct eigenvalues of a selfadjoint 
matrix orthogonal, but every n x n selfadjoint matrix posesses a full comple- 
ment of n mutually orthogonal eigenvectors. This is the content of Property 3. 


Property 3. Every n x n selfadjoint matrix A has n mutually orthogonal 
eigenvectors. 


Proor. Every matrix has at least one eigenvector. Let v, be an eigenvector of 
A with eigenvalue 4,, and let W be the subspace of V, which consists of all 
vectors in V that are orthogonal to v,. We leave it to the reader to verify (see 
Exercise 7) that W is a linear subspace of V of dimension n — 1. The crucial 
observation now is that A takes W into itself; i.e., if x is orthogonal to v,, then 
so is Ax. But this follows immediately from the observation that 


(Ax, V,) = (x, A*v,) = (x, AV,) = (x, A, V1) = A, (X, V;). 
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Thus, (x, v,) = 0 => (Ax,v,) = 0. 
The next part of our proof is a little tricky. Recall from Section 3.7 that 
every n x n matrix A induces a linear transformation @ on C"(R") with 


@ (x) = Ax. (10) 


Conversely, every linear transformation @ on C"(R") can be written in the 
form (10) for some appropriate matrix A. Let @ be the linear transformation 
induced by A*. From the remarks above, @ takes W into itself. Thus, assum- 
ing that (see Exercise 8) every linear transformation that takes a vector space 
into itself has at least one eigenvector, we see that @ has at least one eigen- 
vector v, in W3 Le., 


@(v,) = Av, = A,V, 


for some number /,. Proceeding inductively, that is, by considering the 
subspace of W perpendicular to v,, we can produce n eigenvectors V,, V,..., 
v, Of A that are mutually orthogonal, and this completes the proof of 
Property 3. 


Corollary. Every symmetric n x n matrix A has n mutually orthogonal eigen- 
vectors. 


At the beginning of this section we mentioned that orthogonal bases were 
often very useful in applications. In Chapter 3, we observed that a basis that 
consists of eigenvectors of a matrix A is also very useful since in this basis the 
matrix is diagonal. The ideal situation in practice is a basis of eigenvectors 
of a selfadjoint matrix A, for then we have the best of both worlds, i.e., 
the eigenvectors form an orthogonal basis. This is the real secret behind the 
famous Sturm-Liouville Theory to which we now turn our attention. 


EXERCISES 


‘1. Let V be the space of all polynomials of degree < 2 on the interval [—2,2], with 
inner product 


(fay= | x*fbog(wae, 


Starting with the basis 1, x, and x*, use the Gram-Schmidt procedure to construct 
an orthonormal basis for V. 


2. Let V be the space of all polynomials of degree < n, on the interval [a, b], and define 


b 
(fad =| roo flxg(x) dx. 
Use the Gram-Schmidt procedure to obtain an orthogonal basis for V in the case 
(a) (a,b) = (—1, }), r(x) = |x|"; 
(b) (a,b) = (—oo, 00), r(x) =e. 
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3. Obtain the first four Hermite polynomials by applying the Gram-Schmidt proce- 
dure to the monomials 1, x, x”, and x°, with weight function r(x) = e~*7”. 


4. Let V be the n-dimensional space spanned by the n functions e~**, k = 0, 1,..., 
n — 1, and let 


(f= | segce)ax 
for f, g in V. Obtain an orthogonal basis for V. 


5. Obtain the first four Laguerre polynomials by applying the Gram-Schmidt proce- 
dure to the monomials 1, x, x, and x°, with weight function r(x) = e~*. 


6. Show that (A*)* = A. 


7. Show that the subspace W introduced in the proof of Property 3 has dimension 
n—1. 


8. Show that every linear transformation that takes a vector space into itself has at 
least one eigenvector. 


6.4 Sturm-Liouville theory 


In Chapter 5 we studied equations of the form 


LL y](x) = y"(x) + Ay(x) = 0 (1) 
subject to the boundary conditions 
a, y(O) + byy'(0)=9, = an y(I) + Bn y'(I) = 0. (1’) 


Equation (1) together with the boundary conditions (1’) was referred to as a 
boundary value problem. We then stated a theorem to the effect that this 
boundary value problem has a nontrivial solution only for a special set of 
numbers 


hy <n <h3y<c<d<ce 


called eigenvalues and every differentiable function y(x) can be expanded in a 
series of these eigenfunctions, i.e., 


y(x) = Yo cy (x), = O<x< 
n=1 


For example, the boundary value problem 
y"(x) + Ay(x) =0, = y(0) = O, y() = 9, (2) 
has eigenvalues 1,, = n?x7/I?, with corresponding eigenfunctions 


ATX 
“7 9 


y, (x) = sin 


n= 1, 2,..., 
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and every differentiable function f(x) can be expanded in a Fourier series of 
the form 
~ . nmx 
yxy= > c, SiN, O<x<l. 
n=1 


The theorem we just stated is a special case of a much more general theorem, 
which applies to the more general equation 


LL y] (x) = a(x)y"(x) + b(x)y'(x) + c(x) v(x) = —Ar(x) yx) (3) 
and the boundary conditions 
a, y(a) + bya) =90, an y(B) + br y'"(B) = 0. (3’) 


We can gain a lot of insight into this problem by studying it from a linear 
algebra point of view. To wit, let V be the space of all twice continuously 
differentiable functions. V is a linear vector space but is infinite dimensional, 
rather than finite dimensional. Next, define an inner product on V via the 
relation 


B 
(f.g) = | Flxg(x) dx. (4) 
The operator L defined by 
LLy] (x) = alx)y"(x) + b(xDy'00) + e(x)y(x) (5) 


is a linear operator defined on V (see Section 2.1) and it is natural to ask 
whether L is selfadjoint in the sense that 


(Lu, v) = (u, Lv) (6) 


for all functions u, v in V. The answer to this question is given in Theorem 4. 


Theorem 4. Let V be the space of all twice continuously differentiable functions 
on the interval [a,B]. Then, the operator L defined by Equation (5) is 
selfadjoint on V, in the sense of Equation (6), if and only if 


b(x) = a’(x). (7) 
Proor. Computing 
(Lu, v) = [- [a(x)u"(x) + b(x)u'(x) + c(x)u(x)] v(x) dx 
and 
(u Le) = [* Cala)o"a) + BOo(x) + ela) ue) dx 
we see that 
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(Lu, v) — (u, Lv) = [- [a(x)v(x)u" (x) + b(x)v(x)u’(x)] dx 
— [- [a(x)u(x)v” (x) + b(x)u(x)v'(x)] dx. 
Integrating the terms containing u”(x) and v’(x) by parts gives 
B B 
(Lu, v) — (u, Lv) = -| a(x)u’(x)v' (x) dx -| a’'(x)v(x)u'(x) dx 
B B 
+ | b(x)v(x)u'(x) dx + | a(x)u'(x)v'(x) dx 
B B 
+ | a'(x)u(x)v’(x) dx — | b(x)u(x)v’ (x) dx 
+ a(x)v(x)u’(x)|f — a(x)u(x)o'(x)|2 


B 
= | [oC ~ a’) LoG)u"(x) — uCxJ0'(e)] de 


+ a(x) [v(x)u’(x) — u(x)v'(x)]2- 


(8) 


The boundary terms in Equation (8) are zero. To see this, assume that the 
constants b,, b, in Equation (3’) are not zero (the proof is trivial if they are— 
see Exercise 1), and set c, = a,/b,, c. = a,/b,. Then 


a(x) [v(x)u’(x) — u(x)v'(x)] 8 
= a(B)[v(B)u'(B) — u(B)v'(B)] — ala) Lo(a)u’(a) — u(a)v’(a)]; (9) 
but 
u'(B) =czu(B), v'(B)=czv(B), —u"(a@) = cy u(x), v'(a) = cy o(a) 


since u(x) and v(x) satisfy the boundary conditions (3’). Hence, the right-hand 
side of Equation (9) becomes 


a(B)[v(B)czu(B) — u(B)c2v(B)] — a(a)[v(a)c, u(a) — u(a)c,v(a)] = 0. 


Thus, we have shown that 


B 
(Lu, v) — (u, Lv) = | [b(x) — a’(x)] [v(x)u’(x) — u(x)v'(x)] dx (10) 


for all u and vin V. The right-hand side of Equation (10) is zero if b(x) = a’(x). 
Thus, L is certainly selfadjoint in the sense of Equation (6) if b(x) = a’(x). In 
addition, it is intuitively clear that b(x) = a’(x) is also a necessary condition 
for L to be selfadjoint; the right-hand side of Equation (10) cannot vanish for 
all u and v in V if b(x) £.a’(x). A proof of this assertion is outlined in the 
exercises. 
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In summary, the boundary value problem (3), (3’) is self-adjoint if and only 


if b(x) = a’(x). Setting a(x) = p(x) and c(x) = — q(x), we can now write all 
selfadjoint boundary value problems in the canonical form 
LU y](x) = Lp()yQ)Y — ax) y(x) = — Ar(x) yx) (11) 
a, y(a) + by'(a) =90, = an y(B) + bn y’(B) = 9. (11°) 


Equation (11) together with the boundary conditions (11’) is often referred to 
as a Sturm-Liouville boundary value problem. 


Remark 1. In the notation of Equation (11), Equation (8) becomes 
(Lu, v) — (u, Lv) = — p(x)[u'(x)v(x) — u(x)o’(x)]2- (12) 


Equation (12) is known as Lagrange’s identity, and if u and v satisfy the 
boundary conditions (11’), then Lagrange’s identity reduces to 


(Lu, v) — (u, Lv) = 0. (13) 


Remark 2. We have derived Equation (13) in the case that u and v are both 
real, because that is how Lagrange’s identity is usually expressed. It is a simple 
matter, however, to verify (see Exercise 3) that Equation (13) is also true for 
u and v complex, provided we define 


(fa) = | " f00)g00 dx. (14) 


Definition. A Sturm-Liouville boundary value problem is said to be regular 
if each of the following conditions hold: 
(i) r(x) > O and p(x) > 0 for x € [a, fp]. 
(11) p(x), p’(x), q(x), and r(x) are continuous on [a, f]. 
(111) (a, 6) 1s finite. 


We are now ready to state and prove the main theorem of this chapter. 


Theorem 5. For any regular Sturm-Liouville boundary value problem: 
(1) All the eigenvalues (and consequently all the eigenfunctions) are real. 
(2) Eigenfunctions belonging to different eigenvalues are orthogonal under 
the inner product 


B 
if,g> = | r(x) f0g(x) dx. (15) 


(3) To each eigenvalue, there corresponds one, and only one, eigenfunc- 
tion. 

(4) There are a countably infinite number of eigenvalues Ao, 4,, 42, ... with 
corresponding eigenfunctions yo(x), y,(x), y2(x),.... The eigenvalues 4, can 
be ordered so that n refers to the number of zeros of y, in the interval [a, B]. 
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In addition, the zeros of y,+, interlace those of y,. Finally, 49 < 4, < 
dy <A3°°: with A, > 00 asn— oO. 

(5) Let f(x) be any continuously differentiable function on the interval 
[o, 8]. Then, f(x) can be expanded in a convergent series of the eigenfunc- 
tions of L; i.e., 


Fl) = Y anya(o) 


where 


B 
| r(x) flo) y,(oo) dx 
(16) 


a, = 


{ " r(x)y20)dx 


a 


ProoF. (1) The proof of (1) follows immediately from the selfadjointness of L: 
if Ly = —Ay then 


B 
(Ly, y) =(—Ary,y) = —4 | r(x)ly2(0)| dx (17) 


a 


and 


_ ¢B 
(95Ly) = (Ary) = -7 | reoly*ol dx (18) 
since r(x) is real. Equating Equations (17) and (18), we see that 4 must be real. 
(2) Let y,(x) and y,,(x) be eigenfunctions of L with eigenvalues A, and J,,, 
respectively, with A, # 4,,. Then 
(Ly,,, 9) — (—AnTYns Yn) 
B 
= — Ay | re) 6) Yu ds 
however, since L is selfadjoint, 
(LY ns Ym) = (Vas LYm) 
(y no Am ry m) 


B 
dy | +(x) Yn(%) Ym) dx, 


and since A, # /,,, we see that 


B 
| 1(X) Vp) Vg) dx = CV qs Vm = 0. 


(3) Suppose that u,(x) and u,(x) are two independent eigenfunctions 
with eigenvalue 4. Then both u, and wu, satisfy the second-order differential 
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equation 
(p(x)y') — a(x)y = —Ar(x)y (19) 
in addition to the boundary conditions 
a, y(a) + byy"(a)=0, = an y(B) + ba y(B) = 9. (20) 


Since u, and u, are independent, they form a basis for all the solutions of 
Equation (19), that is, every solution y(x) of Equation (19) can be written in 
the form 


y(x) = C14, (x) + C2U2(x) (21) 


for some choice of constants c,, c,. Finally, observe that if u,(x) and u,(x) 
satisfy the boundary conditions (20), then so does the linear combination (21). 
Hence, every solution of Equation (19) must satisfy the boundary conditions 
(20), which is clearly absurd. Thus, to each eigenvalue 4 of L, there cor- 
responds only one eigenfunction. 

(4) The proof of (4), though not very difficult, is quite long, so it will be 
omitted here. 

(5) The convergence part of (5) is too difficult to present here. We wish to 
point out though, that if f(x) can be expanded in a series of the eigenfunctions 


y,(X), 1.€., 
F0) = Y ayn(29 (22) 


then a, must, of necessity, be given by Equation (16). To see this, assume that 
the series (22) is convergent and that the inner product of f with any function 
y(x) can be distributed, that is, 


CAD = Yay Yard? (23) 
If y(x) = y,(x), then Equation (23) reduces to 
cf y;> = A;< Vj, Vj> (24) 


by virtue of the orthogonality of the eigenfunctions y (x), y, (x), y2(x),..., and 
it now follows immediately from Equation (24) that 


B 
fy _ |, Penfeaneras 
Yin Vi ) r(x) y2(x) dx 
Part (5) of Theorem 5 deals with pointwise convergence, i.e., the series (22) 
converges for each x in (a, B) if f(x) is continuously differentiable. A different 
type of convergence, which has proven quite useful in applications, is known 
as convergence in the mean. 
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Definition. A sequence of functions f(x), « < x < B, is said to converge in 
the mean to f(x) if 


B 1/2 
If — fll = [rere — 400? ax 
approaches zero as n > 0. 


Part (5) of Theorem 5 now has the following analogue in terms of mean 
convergence. 


Theorem 6. Let 


fA) = Yay (o) (26) 


where a; is given by Equation (25). Then, f,(x) converges to f(x) in the mean 
for all functions f(x) with 


| " W(x) f2(x) dx < 00, 


a 


i.e., for all functions f(x) with finite norm. 


Example 1. Consider the Sturm-Liouville boundary value problem 


Lly}=y"=—Ay, — y'(0) = 0, y'(1) = 0. (27) 
Here p(x) = r(x) = 1 and q(x) = 0. It is easily verified that 


y = y,(x) = cos nnx, n= 0, 1, 2,..., 


are the nontrivial solutions, i.e., eigenfunctions, of Equation (27), with cor- 
responding eigenvalues 


A, = n°n?, n=0,1,2,.... 


Parts (1)-(3) of Theorem 5 are trivial to verify and, in fact, have been done 
already in Chapter 5. The zeros of y,(x) are located at the n points 


1 3 5 2n—1 
2n’2n’2n” 2n ’ 


and it is easily verified that the zeros of y,,, interlace those of y,. For example, 


the four zeros 4, 3, 3, and j of y,(x) interlace the three zeros }, 3, 2 of y3(x) since 


Pet 3 3 5 Li 
8 6 8 6 8 6 8 
Finally, part (5) of Theorem 5 is simply Fourier’s Theorem of Section 5.4. 
Let us return now to the Hermite, Legendre, Tchebycheff, and Laguerre 
differential equations 
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y’ —xy’+ Ay=0, —-0o<x< 0, (28) 

(1 — x2)y” —2xy’ FAA + Dy=0, —l<x<l, (29) 
(1 — x”)y” — xy’ + A?y =0, —-1<x<1l, (30) 
xy’+(l—xy’t4y=0, O<x<~, (31) 


respectively. Except for the Legendre equation (29), the coefficient of y’ in 
each of these equations is not the derivative of the coefficient of y’. We remedy 
this situation by multiplying through by an appropriate integrating factor 
(see Section 1.2). Then we can rewrite Equations (28)—(31) in the equivalent 
form 


L[y](x) = fo y'(x) = —Je~*’!? (x), —0o<x<oo, (28’) 
d 
LLy|(x) = — x*)y"(x) = —AAt+ Dy), —-1<x<t, (29) 
d 2)1/2,," —A? ! 
Llyl(%)= 7 0 — x*) YO) = Taya VO) —-1<x<l, (30) 
LE y](x) = £ xe*y'(x) = — he *y(x), 0<x< oo. (31’) 


Indeed, we can now begin to understand the presence of the factor r(x) in the 
Sturm-Liouville problem 


LL y](x) = —Ar(x)y(X). 


Our next step is to determine what boundary conditions should be im- 
posed on the solutions of Equations (28)—(31). To this end, observe that none 
of the Equations (28’)—(31’) are regular in the sense of the definition preceding 
Theorem 5. To wit, the functions 


p(x) =1—x* and p(x) =(1— x’)? 


for the Legendre and Tchebycheff equations vanish at the end points — 1 and 
+ 1 (which in a limiting sense is also true for the functions e~*”? and xe~, 
which occur in the Hermite and Laguerre equations), and the interval (a, B) 
for both the Hermite and Laguerre equations is not finite. 

Let us return now to our proof of Theorem 4 where we derived the relation 


(Lu, v) — (u, Lv) = — p(x)[u'(x)v(x) — u(x)v'(x)]5. (32) 


For the operators L appearing in Equations (28)—(31), the integrals appearing 
in the left-hand sides of Equation (32) may now have to be considered as 
improper integrals. In the case of the Legendre and Tchebycheff equations, 
the function p(x) vanishes at the end points. Thus, L will certainly be self- 
adjoint if we impose the boundary conditions that y’(x) [and hence y(x), see 
Exercise 12] is bounded as x > +1. In the case of the Hermite equation, we 
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require that 


lim e7*7/?[u(x)v'(x) — u’(x)v(x)] = 0. (33) 
x7 +0 

In the case of the Laguerre equation, p(x) vanishes at x = 0. Thus, one 
boundary condition is that y’(x) [and hence y(x)] is bounded as x > 0. The 
second boundary condition is determined from the requirement that 

lim xe *[u(x)v’(x) — u’(x)v(x)] = 0. (34) 
Finally, we note that the conditions (33) and (34) translate into the boundary 
conditions 

lim e7*7/?y(x)=0 and lim xe *y(x) =0 
x7 +00 x 2 

for the Hermite and Laguerre equations, respectively. 

It is interesting to note (but difficult to prove) that the boundary conditions 
imposed on the solutions of Equations (28)—(31) are exactly sufficient to force 
the power series solutions of these equations to terminate at a finite point. In 
other words, the only solutions of Equations (28)—(31) that satisfy our bound- 
ary conditions are the Hermite, Legendre, Tchebycheff, and Laguerre poly- 
nomials, respectively; or, in the language of linear algebra, the only nontrivial 
solutions of the equation 


LLy](x) = —Ar(x)y(x) 


for the operators L in Equations (28’)-(31’) and the boundary condi- 
tions discussed above are the eigenfunctions y,(x) with eigenvalues /,, and 
these eigenfunctions, when properly normalized, are the Hermite, Legendre, 
Tchebycheff, and Laguerre polynomials, respectively. 


Remark. We have imposed boundary conditions on Equations (28)—(31) to 
ensure that the respective operators L are selfadjoint. This will guarantee that 
parts (1) and (2) of Theorem 5 are true. More generally, parts (1) and (2) of 
Theorem 5 are true even for singular Sturm-Liouville problems, but parts 
(3)-(5) may sometimes fail to be true, even though they are true for the 
Hermite, Legendre, Tchebycheff, and Laguerre equations. 

We conclude this section by proving that the only nontrivial solutions of 
the Hermite equation 


y” —2xy’+Aay=0 or (e°*7/?y'! = —AeW*?y (35) 
satisfying the boundary conditions 
lim e7*7/?y(x) =0 (36) 
xX>+0 


are the Hermite polynomials H,(x), which are the polynomial solutions of the 
equation 


y” — 2xy’ + 2ny = 0. 
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In other words, all nonpolynomial solutions of Equation (35) fail to satisfy 
the boundary conditions (36). To this end we set 


ye) = Yay” 
Computing 
y' (x) = >, na,x"' and y'(x)= y n(n — 1)a,x"? 
and plugging into Equation (35) yields 
y (n + 2)(n + 1)dgy 2x" — y, 2na,x" + y ja,x" = 0. 


This, in turn, implies that 


_ (Qn-A) 
“m2 ~ n+ Dn + 1) ™ 


For the moment, let us set a, = 0 and ay = 1. Then Equation (37) forces all 
the odd coefficients a3, a;,..., to be zero. Next, observe that 


2 (n—2/2) 


(37) 


= —___"“q , 38 
“m2 42) (nt (8) 
For large n, 
(n— 2/2), 
(n+1) °° 
Thus, for n > N sufficiently large, 
2 
= ere © ) N. 
An+2 n+ 5 (9a, n> (39) 
Next, look at 
z(x)= > b,x" 
n=0 
where 
b+ = : (9)b, and b=1 (40) 
nt+2 ~~ n+ y) . n O~ *: 
Then, 


2 1 
b, = 569) = 7?) 
2? 1 
b, = 540 - a 9)" 
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93 


b. = 
© 2-4°6 


I 3 
(9)° = 3,(9) 


and 


2(x) = 1 +e OO a ON 6 go 
2 2\2 2\3 
(.9x*) (.9x*)*  (.9x*) poe 


aT 2 31 


2 
_— e2* 


It is clear from Equations (39) and (40) that for 2 4 2n (so that a,,. is never 0) 
y(x) > py(x) +e? 
for some appropriate polynomial py(x) of degree N. But then 


lim e7*/?y(x) = lim [e-*”?p,(x) + e*”] = o. 


x7 +0 x7>+0 


Hence, the only way y(x) can satisfy the boundary condition (36) is if A = 2n, 
so that its series terminates. 


The case ay = 0, a, = 1 is similar, and we leave it as an exercise for the 
reader (see Exercise 13). 
EXERCISES 


1. Show that the boundary terms in Equation (8) are trivially zero if either b, or b, 
iS Zero. 


2. Here is the outline of a proof that L [see Equation (3)] is not selfadjoint if 
b(x) # a’(x). Observe first that if b(x) # a’(x) and we can choose u and v so that 


v(x)u'(x) — u(x)v'(x) = b(x) — a’(x), 


then the integral in Equation (10) reduces to 


| ” [b(x) — a'(x) J? dx # 0. 


The only problem is that u and v have to satisfy the boundary conditions (3’). To 
modify the proof, 


(a) Observe that 


v(x)u'(x) — u(x)v"(x) = v2(x) (*) | 
(b) Choose u, v, and constants c,, C3,...,C,+; (with k to be determined) so that 


(“) =C, [b(x) — a’(x)] +C3 [b(x)—a’(x)]}? +C5 [ b(x) _ a'(x)]> oes, 
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u and v satisfy the boundary conditions (3’), and the integral in Equation (10) 
is £0. 


3. Show that Equation (13) is true even for u, v complex. 


4. Show that Liouville’s transformation 


Zz ; r(x) 
= Gearey = = Tae 
reduces Equation (11) to 

z” — f(t)z = —Az. 


What is f(t)? Note that under this transformation, the eigenvalues remain fixed 
and the weight function becomes unity. 


5. Let P,(x) be the Legendre polynomial of degree n. 
(a) Show that P/(x) satisfies a selfadjoint equation with A = n(n + 1) — 2. 


1 
(b) Show that | P'(x)P/(x)(1 — x2)dx =0,m nn. 
—1 
6. Find the eigenvalues and eigenfunctions of the boundary value problem 
x*y"=—Ay, — yl) = 0, y(2) = 0. 
(Hint: Try y = x?.) 


In each of Problems 7-9, find the eigenvalues and eigenfunctions of the given 
boundary value problem. 


7. yi +Ay=0; yO) =0, y(n) — y(n) = 0 
8 yi +Ay=0; — y(0) — y"(0) = 9, y(1) = 0 
9. yi tay=90; — y(0) — y'(0) = 9, v(x) — y'(x) = 0 
10. For which values of 4 does the boundary value problem 
yy —2y'+(1+Aa)y=0, — y(0) = 9, y(1) = 0, 
have a nontrivial solution? 
11. Show that the singular boundary value problem 
y+ Ay=90, — y(0) = y(2n), y'(0) = y'(2n), 
has a continuum of eigenvalues. 


12. Suppose that y’(0) is bounded as x + +1. Show that y(x) must also be bounded as 
x > +1. [Hint: Write y(x) as an integral involving y’(x).] 


13. Let ag = Oand a, = 1. Show that the power series solution )'*. a,x" of Equation 
(35), with a, determined from Equation (37), will not satisfy the boundary condi- 
tions (36) unless the series terminates, i.e., 1 = 2n for some integer n. 
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Some simple facts concerning functions 
of several variables 


1. A function f(x,y) is continuous at the point (X9,yo) if for every e >0 
there exists 6(€) such that 


If(xy)-fxorol<e if [x— xl tly —yol <8 (). 
2. The partial derivative of f(x,y) with respect to x is the ordinary de- 
rivative of f with respect to x, assuming that y is constant. In other words 


Of(Xo¥o) ..  F(Xot hyo) -F(X0¥0) 
= iQ 
Ox h—0 h 


3. (a) A function f(x,,...,x,) 18 said to be differentiable if 


of 
0x 


n 


0 
f(x, + 4Ax,,...,x, tAx,) —f(%1,...5%,) = pant ot Ax, +e 


where e/[|Ax,|+... + |Ax,|] approaches zero as |Ax,|+...+|Ax,| ap- 
proaches zero. (b) A function f(x,,...,x,) is differentiable in a region R if 
Of/0x,,...,0f/ 0x, are continuous in R. 

4. Let f=f(x),...,%,) and x=2,()1,---.Ym) J=l....n. If f and g are 
differentiable, then 


> em 
We 9 9X; Ws 


This is the chain rule of partial differentiation. 
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5. If all the partial derivatives of order two of f are continuous in a re- 
gion R, then 
af ae 


Ox,9x, 9x, 0X,’ 


J, k=1,...,n. 


6. The general term in the Taylor series expansion of f about x, = 
x0... x, = x2 is 


1 G/F FI (xD, 05 Xp) 


TT ;, 7, (x, — x)!" (x, — x0)” 
JiteSn Ox4!... 0x? 


n 
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Sequences and series 


I. A sequence of numbers a,, n=1,2,... is said to converge to the limit 
a if the numbers a, get closer and closer to a as n approaches infinity. 
More precisely, the sequence (a,,) converges to a if for every e >0 there ex- 
ists N(e) such that 


la—a,|<e ifn>N(e). 


2. Theorem 1. Jf a,—a and b,->b, then a,+b,>a+ b. 


PROOF. Let e>0 be given. Choose N,(e), N>(€) such that 
ja—a,|< 5 forn>N,(e), and lb— bl <5 for n> N,(e). 


Let N (e)=max{N,(e), N,(e)}. Then, for n> N(e), 
Ja, + b, —(a£b)| <|a,—al+|b,-b]<5 + 5 =e. oO 


3. Theorem 2. Suppose that a, , > a,, and there exists a number K such that 
|a,|< K, for all n. Then, the sequence (a,) has a limit. 


Proor. Exactly the same as the proof of Lemma 1, Section 4.8. oO 


4. The infinite series a, +a,+...= > a, is said to converge if the 


sequence of partial sums 
Sj=Q), Sg=a,;t+a,, ..., S,=a,ta,t...ta,,... 


has a limit. 
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5. The sum and difference of two convergent series are also convergent. 
This follows immediately from Theorem 1. 


6. Theorem 3. Let a,>0. The series >) a, converges if there exists a number 
K such that a,+...+a,<K for all n. 


ProoF. The sequence of partial sums s,=a,+...+a, satisfies s,,, 25 


n° 


Since s, < K, we conclude from Theorem 2 that >) a, converges. 0 


7. Theorem 4. The series >\ a, converges if there exists a number K such 
that |a,|+ ...+|a,|<K for all n. 


Proor. From Theorem 3, >) |a,| converges. Let b, =a, +|a,|. Clearly, 


0<b,<2\a,|. Thus, >) b, also converges. This immediately implies that the 
series 


2 4n= 2 [b,—l4nl] 


also converges. 0 


8. Theorem 5 (Cauchy ratio test). Suppose that the sequence |a,,,/a,| has 
a limit \. Then, the series >, a, converges if N<1 and diverges if X>1. 


Proor. Suppose that \< 1. Then, there exists o<1, and an index N, such 
that |a,,,|<pla,| for n> N. This implies that |a,,,|<p?|a,|. Hence, 
N+K |ay| 


> |a,|<(U+p+...+p*)lay| <— 
n=N l—p 


bd 


and >) a, converges. 
If A>1, then |ay,,|>p”|ay|, with p > 1. Thus |a,|—>00 as noo. But |a,| 
must approach zero if >) a, converges to s, since 
lan +11=[5n417 Sal < ISn41—5|+|5,— 5] 
and both these quantities approach zero as n approaches infinity. Thus, 
>) a, diverges. O 
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Appendix C 


C Programs 
C Program 1 


+ include <stdio.h> 
+ include <math.h>» 


int main( ) 


{ 
double X [200] = {1}; 
int k, N= 15; 
printf (‘ N%10sX[N]\n’,‘ ’’); 
for (k=0;kK<=Njk+ +) 


printf(‘“%4d%4s%17.91f\n’’,k, “ ’’, X[k]); 
X[k + 1] =0.25 + sin(X[k]); 


j 
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C Program 2 


+ include <stdio.h> 


int main( ) 

{ 
double X[200] = {1.4}; 
int k, N=5; 


printt (“ N%10sX[N]\n’’, “* ’’); 
for (k=0;kK<=N;k++) 


printf(“%4d%4s%17.91f\n’’, k, “* ’’, X[k]); 
X[k + 1] = X[k]/2 + 1/X[k]; 


} 


C Program 3 


+ include <stdio.h) 
+ include <math.h)> 


int main( ) 

{ 
double a, d, V[200] = {45.7}; 
double c=0.08, g=32.2, W=527.436, B= 470.327; 
int k, N= 15; 


a=(W—B)/c; 

d=300+c*g/W; 

printf (‘‘ N%10sX[N]\n’’, ‘* ’’); 
for (k=0;kK<=N;k++) 


printf(“%4d%4s%17.91f\n'',k, °°’, V[k]); 


V[k+ 1] =V[k] + ((a— V[k])/V[k]) 
«(V(k] +d+a+*log(1—(V[k]/a))); 
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C Program 4 


+ include <stdio.h> 
+ include <math.h> 


int main( ) 

{ 
double T[1000] = {0}, Y[1000] = {0.5}, a=1, h; 
int k, N= 10; 


h=alN: 
for (k=0; k<N; k++) 


T[{k+ 1] = T[k] +h; 
Y[k+ 1] =Y[k] +h (1+ pow(Y [k] — T[k], 2.0)); 


printf(‘‘%4s%16s\n’’, eT “y"): 
for (k=0;k<=N;k++) 
printf (% 10.71f%2s%16.91f\n’’, T[k], ‘* ’’, Y[k]); 


C Program 5 


+ include <stdio.h> 
+ include <math.h> 


int main( ) 

{ 
double T[1000] = {0}, Y[1000] = {0.5}, a=1, h; 
int k, N= 10; 


h=a/N; 
for (k=0;k<N; k++) 


T[k+1]=T[k]-+h; 
Y(k+ 1] =Y[k] +h « (1+ pow(Y[k] —T[k], 2.0)) 
+h«h*pow(Y[k] —T[k], 3.0); 


printf(‘“%4s%16s\n”, “TT, my"); 


for (k=0;k<=N;k+-+) 
printf (‘‘%10.71f%2s%16.91f\n”’, T[k], “ ”’, Y[k]); 
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C Program 6 


+ include <stdio.h> 
+ include <math.h> 


int main( ) 


double T[1000] = {0}, Y[1000] = {0.5}, a=1, h, R: 
int k, N= 10: 


h=a/N; 
for (k=0;k<N; k++) 
{ 
R=1+ pow(Y [k] — T[k], 2.0); 
T[k + 1] =T[k] +h: 
Y[k+ 1] =Y[k] + (h/2) *(R+1 
+ pow(Y [k] +h*R—T[k+ 1], 2.0)); 


printf(‘‘%4s%16s\n’’, ‘T’’, “Y’’); 
for (k=0;k<=N;k++) 
printf (%10.71f%2s%16.91f\n’’, T[k], “* ’’, Y{k]); 


C Program 7 


+ include <stdio.h> 
+ include <math.h> 


int main( ) 


double T[1000] = {0}, Y[1000] = {0.5}, a=1, h; 
double LK1, LK2, LK3, LK4: 
int k, N= 10: 


h=a/N; 
for (k=0;k<N; k++) 
{ 

T[k+ 1] =T[k] +h; 
LK1=1+ pow(Y [k] — T [k], 2.0); 
LK2= 1+ pow((Y[k] + (h/2) *LK1) — (T[k] +h/2), 2.0): 
LK3 = 1+ pow((Y[k] + (h/2) « LK2) — (T[k] +h/2), 2.0); 
LK4=1+pow((Y[k] +h*LK3) —(T[k] +h), 2.0); 
Y[k + 1] =Y[k] + (h/6) * (LK1+LK4+42«(LK2+LK3)): 


printf(“"%4s%16s \n’’, ‘‘T’’, “Y’’); 


for (k=0;k<=N;k++) 
printf (‘‘% 10.71f%28%16.91f \n’’, T[k], ‘© ’’, ¥[k]); 
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C Program 8 


+ include <stdio.h> 
+ include <math.h> 


int main( ) 


{ 


double T[1000] = {0}, Y[1000] = {0}, a=1, h: 
int k, N= 10; 


h=a/N; 
for (k=0;k<N;k++) 


T[k+ 1] =T[k] +h; 
Y[(k+ 1] = Y[k] +h « (Y[k] « (1 + exp(— Y[k])) + exp(T [k])); 


printf(‘‘%4s%10s%20s\n’’, “‘N’’, ‘‘h’’, “Y{[N]’’); 
printf (“%4d%2s%10.71f%20.101f\n'’, NN,“ ”’, h, Y[N]); 


C Program 9 


+ include <stdio.h> 
+¢ include <math.h» 


int main( ) 


{ 


double T[1000] = {0}, Y [1000] = {0}, a=1,h; 
double DY1, DY2; 
int k, N= 10; 


h=a/N; 
for (k=0;kK<N;k++) 


{ 


T[k-+ 14] =T[k] +h: 

DY1=1+(1—Y[k]) *exp(—Y[k]): 

DY2=Y[k] « (1+ exp(—Y[k])) + exp(TIk]); 

Y[k+1]=Y[k] +h*DY2+ (h«h/2) « (exp(T[k]) + DY1 * DY2): 


printf (“%4s%10s%20s\n", “N’, “h’, “Y[N]"); 
printf (““%4d%28% 10.71f%20.101f\n"’, N, ““ ”, h, Y[N]); 


Appendix 
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C Program 10 


+ include <stdio.h> 
+ include <math.h> 


int main( ) 


double T[1000] = {0}, Y [1000] = {0}, a=1,h; 
double R1, R2; 
int k, N= 10; 


h=al/N; 
for (k=0;k<N;k++) 


T(k+ 1] =T[k] +h; 

R1=Y[k] « (1 + exp(— Y[k])) + exp(T[k]); 
R2=(Y[k] +h*R1) «(1+ exp(—(Y[k] +h*R1))) +exp(T[k+ 1]); 
Y[k+ 1] =Y[k] + (h/2) « (R1+ Ra); 


printf (‘“%4s%10s%20s\n’’, ““N’’, “h’, “Y[N]’’); 
printf (‘%4d%2s% 10.71f %28%18.101f\n’’,N,“ ",h,* ’, Y[N]); 


j 


C Program 11 


+ include <stdio.h> 
+ include <math.h» 


int main( ) 


double T[1000] = {0}, Y[1000] = {0}, a=1, h; 
double LK1, LK2, LK3, LK4; 
int k, N= 10: 


h=a/N; 
for (k=0;k<N; k++) 
{ 
T[k +1] =T[k] +h; 
LK1=Y[k] « (1+ exp(— Y[k])) + exp(T[k]); 
LK2=(Y[k] + (h/2) « LK1) «(1+ exp(—(Y[k] + (h/2) « LK1))) 
+ exp(T[k] + (h/2)); 
LK3=(Y[k] + (h/2) « LK2) « (1+ exp(—(Y[k] + (h/2) « LK2))) 
+ exp(T[k] + (h/2)); 
LK4=(Y[k] +h*LK3) «(1+ exp(—(Y[k] +h*LK3))) 
+ exp(T[k+ 1]); 
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Y(k+ 1] = Y[k] + (h/6) « (LK1+2*LK2+42+*LK3+4LK4); 


printf (“%4s%10s%20s\n’’, “*N’’, “h’’, “Y[N]’’); 
printf (““%4d%2s%10.71f%2s%18.101f\n’,N,‘ ’, hh,“ ’’, Y[N]); 


} 


C Program 12 


+ include <stdio.h> 
+ include <math.h» 


int main( ) 

{ 
double T[1000] = {0}, Y[1000] = {1}, a=1, h; 
double LK1, LK2, LK3, LK4; 
int k, N= 10; 


h=a/N; 
for (k=0;k<N; k++) 
{ 

T[k + 1] =T[k] +h; 
LK1=T[k] * T[k] + Y[k] * Y[k]; 
LK2=pow(T[k] +h/2, 2.0) + pow(Y[k] + (h/2) * LK1, 2.0); 
LK3 = pow(T[k] + h/2, 2.0) + pow(Y[k] + (h/2) * LK2, 2.0); 
LK4=pow(T[k] +h, 2.0) + pow(Y[k] +h « LK3, 2.0); 
Y(k+ 1] =Y[k] + (h/6) * (LK1+2*LK2+2+*LK3+4LK4); 


printf(““%4s%15s\n’’, “T’, “Y”); 
for (k=0;k<=N;k++4+) 
printf('%10.71f%2s%16.91f\n’’, T[k], “* ’’, Y[k]); 


C Program 13 
# include <stdio.h> 
+ include <math.h>» 
+ define PI 3.141592654 


int main( ) 


( 
double T[1000], Y[1000], h; 
int k, N=25; 


h = 2.0/(double)N; 
TH]=h; 
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Y¥(1] =0; 
for (k=1;k<N;k++4+) 


T[k+ 1] =T[k] +h; 

if (Y[k] ==0) Y{k+ 1] =h* T[k] *sin(PI/T[k]); else 

Y(k+ 1] =Y[k] +h « (Y[k] « pow(fabs(Y [k]), — 0.75) 
+ T[k] « sin(PI/T[k])); 


printf(‘‘%4s%15s\n’’, eT my"); 
for (k=1;kK<=N;k++) 
printf(‘‘%10.71f %2s%16.91f\n”, T{k], “ ’’, Y[k]); 


C Program 14 


+ include <stdio.h> 
+: include <math.h> 


int main( ) 

{ 
double A[200] = {1, 1}, T=0.5, sum; 
int k, N=20; 


A[2] = —0.5*« (A[1]+A[0]); 
sum=A[0]+A[1]*T+A[2] «T+ T; 
for (k=1;kK<=N—2;k++) 


A[k+ 2] = ((k+ 1) * (k—1) * A[k + 1] —A[k] + A[k— 1]) 
/((kK+ 1) *(K+2)); 
sum=sum-+ A[kK-+ 2] * pow(T, (double)(k + 2)); 


} 


printf(‘“For N= %d and T= %6.41f\n’’,N, T); 
printf(‘‘the sum is: %11.91f\n’’, sum); 


} 


556 


Answers to 
odd-numbered 
exercises 


Chapter | 


SECTION 1.2 


1. y()=ce7s™ 3. y(H= 7 5. y(t) =exp(— SPY [expat + ¢] 


c+ ft ey )'/4ae 


7. y(t) = —______—__ >, y(nmexp(— f'VIFe e~*ds) 


(i+2)'/7(1414)'/4 
3e" —1 
2 


L. y()= 13. y(t)= eaew [ioe << 3 
2r 


y= (5 +E arte)lt ey 1/2 1. y(9= | 0<t<l 


2(e—l1)e~‘, t>1 
21. Each solution approaches a distinct limit as t-0. 


23. All solutions approach zero as t->7/2. 


SECTION 1.3 


3. 127,328 8. (a) N2s9(t)= No39(0)27 19°-°1/45; (b) Na35(t) = Nz35(0)2~ 10-74/-707 
7. About 13,550 B.c. 


SECTION 1.4 

1. y()= oa 3. y(t)=tan(t—417 +c) 5. y(t) =arcsin(c sin?) 
12 

7. yO= 9+2In( 24° , ~O<t<aw 
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Answers to odd-numbered exercises 


9 y()=1-[44214+274+ 2]'/2, -2<1<00 


_ arkt —] eo 
11. y= T+akr’ Gk <t<o, ifa=b 
ab[1—e*O- 94) 1 in , 
Y()= — onan” Kaa MH SIS oH 
ct? (7-1 
13. (b) y()= —t and y()= ina 18. y(O= > 


17. y=c’e2V'"/, t>0; y= —c2e2Vi/y t<O0 19. t+ye'”=c 
21. (b) |(b+c)(at + by) + an+ bm| = keOt Xa-) 
23. (t+2y)°—(t+2y)=c—-Tt 


SECTION 1.5 


3. a>0.4685 
_ —0.0014 
7. (a) ? =0,003p — 0.001p?—0.002; (b) p(t) = 12229:998—999,998e __ 


999,999 — 999,998¢— 0.001" * 
P(t)-2 as t-00 


SECTION 1.6 


cNt 
LO emt 


SECTION 1.7 


1. VQ@O= at — ef 8/W)) 


\/399 l 
Kye 440 - 9)/V322 + 4 3 2 (i- J) +1 
1. V(nev3m | 8 
Kye 4t- 5)/V322 ~] - ( 1 

Ve 


— 22 
9. (a) VV ~- VV, + in BN VY = Set, (b) V;= 8 
c mg—cV Vo m 
gR° 


do ; _ 
11. om ~ Gary (b) Vo= V2gR 


SECTION 1.8 
3, a0? yrs. 5. 10%" 7, (@) c(t) =1 —e-9"*; (6) 1000In > min. 


9. c(N=1(1—e 8) + £0 4-000 11.48% 13. xy=c 


15. x?+y2=cy 117. y*(ny?— 1) =2(c— x’) 
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Answers to odd-numbered exercises 


SECTION 1.9 


3. *siny + ye’ =c 5S. ytant+secr+y?=c 7. y(t) = 172 


— 
XS y)=-?P+V4-(P-1) Il. ry + 1(74,7) = 10 
13 2: y(n=+ fet 09 
e Q==- ; = ——— . = 1° = 1 _— 
y (1+ en) a=1; y(t)=arcsin[(c—f)e"] 
SECTION 1.10 
14 76 2n 
l.y,(Q=C+ a tat. +o 
20 
3. y(t)=e'— 1; yft)=t—e! + 
107 ,¢ ,° 8 /, (itne™ oof 
y,(O= ae tatat ys t2d—de +—>z _ — 3 + 6 
2 
19. y()=sin 5 


SECTION 1.11 
3. (a)O<x9<10 5. 1.48982 7. (c) 0.73909 


Section 1.11.1 
5. 0.61547 7. 0.22105 9, 1.2237 


SECTION 1.12 
1. y,=(-7)"-gl(-7)"—-1] 3. (a) FE, < 33"— J); (b) £, < 2Q"-)) 


5. ¥, = a1.--Ay,—] 7 


poe 


n—1 b; | 
yit da 7. Yas 55 
= a). 


9, (a) x = $251.75; (b) x =$289.50 


SECTION 1.13 

1. A=0.1, yio=1l; A=0.025, yag=1 

3. h=0.1, yig=1;  h=0.025, yyo=1 

5, h=0.1, yi9= 1.80516901; A=0.025, yo = 1.94633036 


Section 1.13.1 
4(107°) 


h(1+e+e?) 
—— < -— 
3(e7-1) 


lL E,.< AH e/5—1) 3. E, < [ee/er+D_—]] 


SECTION 1.14 


1 A=O0.1, yip=1; A=0.025, yyg=1 3. A=O., vig=l; h=0.025, vag=1 
5. h=0.1, yip=2; A=0.025, yan =2 
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SECTION 1.15 


1. h=0.1, yig= 1; h=0.025, y4g= 1 3. h=0.1, yig=1; h=0.025, yao= 1 
5. h=0.1, yi9= 1.98324929; h=0.025, y4q= 1.99884368 


SECTION 1.16 


1. h=0.1, yigo=1; h=0.025, yog=1 3. A=O.L, yip=1;  A=0.025, yag=1 
5. h=0.1, yio= 1.99997769; h=0.025, yao = 1.9999999 


SECTION 1.17 
1. 2.4103 3. 0.0506 5. 0.4388 


Chapter 2 


SECTION 2.1 


1. (a) (4-—30e'; (b) 3V3 tsin V3 t; (c) 2(4—3e'+ 12V3 tsin V3 2; 
(d) 2—32?; ©) 5(2 — 327); (f) 0; (g) 2—32? 


5. (b) W= 5; (A) »()=2Vi 


7. (a) —(bsinatsin bt + acosatcosbt); (b) 0; (c) (b— aje@*; (d) e7; (e) ¢; 
(f) — be 


SECTION 2.2 
1. y(D=ce'+c,e7' 3. y(Q=e*/| ce’ 12-4 eye” V9 1/2] 


y(o= (et +4e~‘) 7. y(H)= V3 1/2 3% t/10 _ e~ 351/10) 


9V>-3 IL yM=ctt+e,/P 


Section 2.2.1 


+ c,sin 


1. yjme "|e cos V3 4 


V3t 
| 


2 
3. y(t)=e7'(c,cos V2 t+c,sin V2 t) 
cos 3 in| 


V7 


9. sone — a cin VC) | 


yee”? 


11. y,()=coswt, y(t) =sin wt 
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Answers to odd-numbered exercises 


15. Vi ae" 1+i -=| V2 +1 +iVV2 -1 ] 
Vai Wi =+[VVE-1 +iVV2 41 |, (Vi =e") 
19, y= —— e,cos YE Inst e,sin Ini 


Section 2.2.2 
Le y(Qe(cytegie = 3. y(NS(It3Ne"? 7. y(t) =2(t— me)? 
I. y(t) = (ce, t+ ete” 13. p(t) = ct + 002 -— 1) 15. y(t) = c(t + 1) + coe” 
c;+c,Int 
17. y()=c,(1 +30) + c,e* 19. y()= —_— 
SECTION 2.3 


l. y(Q=c,toer%+r 3. y(th=e" +2e'-2e7" 


SECTION 2.4 

1. y(t)=(c,+Incost)cost+(c) + #)sint 

3. y(Q=cye'/* +[c,+ 91-207 + Pye! 

§. y(Q=4(2cost—3sint)e~'—e7'+ He 


Int 
3 


c 
7. y= [vse fer "9— elds -Q, year+ = + 


HM. y(Q=Vt ferteanes f'Fv5 coss[In¢—Ins]ds 
0 


SECTION 2.5 


3_ 9p 
Lys At 


7. Y(t)= fet[sin2¢—21tcos2z] 9. Y(t) = 14+ t(cos2t—4sin 272) 


3. WN=14-—Gtt+iPye' 5. y= Se 


_-) . t 1) te! tot 
11. W()= Fy (coss+7sing+(5 aie 13. V(t)=H(e e ) 
15. Y(t) = — i cos3r+isrsine« 
_ l , _ 15 
17. (b) y(t) = 0.005 + 32,000.018 (1°57 301 20,000 cos 307) + 7 Sin 10r 
SECTION 2.6 


1. Amplitude = 4, period = 47, frequency = 8 
7. a> B, where [1+(1—B) ]e~7#=10-° 
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Answers to odd-numbered exercises 


9. y(t)= eo (5 + wt D 


e-*) cos(t~m)-+ fe "sin 
11. 7/2 seconds 

Section 2.6.2 

3. O()= 05 | 1- (coss0ov3 + Fr sin 500V3 Jes) 


_ 3 
Steady state charge= 750,000 


SECTION 2.8 
(- 1"! 


1. y(t)=age~"/? +4, > 3-5..(an +1) 


317. 34 328 3-328 = 3-3-5 7!0 
1+ or + 5a 26.3! 2841 210.5) 


5. y(t)= Ss (n+1\(t- 1)?” 


n=0 


Cod 


- V(t)=a +... 


er 
ta(r+ 5} 


a '6 
5. 6+%3-6-1011 + 3-610-11-15-16 7°"! 
x2 = M4—-A)tA -A(4—-AY(B—A)E 
9. (a) (N= 1- B-  - t 


(b) y,(4) = 42-5 +(2-2)6-A)-E + (2-AY6—NN(10-)-G +... 


7. y(t)= —2[t+ = 


11. (a) y\()=1—a of _ gh? a2) 
— a?(2? ~ a?)\(4? — a) + 
yxH=t+(1?- a?) +(1?—«?)(3?- a?) + 

13. (a) yO=1-4 7-1 + r+... (b) y(4)20.8592436 
15. (a) y\(N=1—40°-th + 424+... (b) y(4)~0.86087198 
17. (a) y()=3 451-477 + B— 304+... (b) y(4)~5.3409005 
Section 2.8.1 

l. y(th=cyt+e,/t? 

3. y(t) =¢,(t—1) +. ¢,(t-1)? 

5. y(t) = t(c; +c, In £) 

7. y(t) = c,cos(In t)+ cy sin(In rt) 
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Answers to odd-numbered exercises 


9. y(t) = —— ad 


2V3 =| 3 


SECTION 2.8.2 
1. Yes 3. Yes 5. No 


cy t? ? 14 7 
1. y= (1-1-5 -s- oe | 
2 22 2 
+ eat (+++ teen t | 


9. y(t) = 14204 F) tet” ee 
WOE 3} 2 27.21-5  23.31-5-7 

3r 3°77 337° | 

Ii. yinae(1+ 75 3737-21373) 


3 377? 3373 
1/4 a 
Feat 3 5° 5-9-2! 5-9-1331 * 


4 
— 5/2 pio, 


y(n=r'/?? an 
2 2 2-4 2-4-6-8 
_ op —p4l,U, Po 
IS. y,(t) = e' /2, y(t) OTT tsa ts t 


17. »,()=——L y(t) =F") 


t 
19. (e) wigaPe "fs dt 


4 
21. (6) (0) =¢(2- ee 
(c) yalt) = yl) f = 
Ko" 
23. (c) y(t) = a(t) f Fn 


At ,0aveF 1. 
ay? ay (2!)° 

4 © AU-9) (mM =I-d) , 
can 


25, (b) y(t) =1-——~ 


no, 
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4r° 
27. (b) y,(t)=1+2¢4+ 27 Atty. 
St, 17t? 89r° 94174 
='/? lt +> + + —__ + 
an (60y(21y * 6)(21)60) 


SECTION 2.8.3 


1. y(t) =14+4t+407+--- 
176; 


y(t) =y(int—|8¢ +12 +O P+ - | 


a 
_§ § "(nt 
n=1 2°"n'(n—1)! 


3. (b) I(t) =41 D> 


(c) (= — F(oine +2 


SECTION 2.9 
] s—a 1{ 1 S 
1 — 3. ———.—_ 5, ~| —+ 
s? (s—a) +b? a1 5 Tea | 
1 eh a—b 7 
2 s?+(at+by s?+(a—by S 
2 
15, y()=2e'—te—1e% 17, ¥(sy= STOS*O 
(s+1) 
2s7+5+2 
19. Y(s)= ——J———_ 23. y(t)= — te! +4e%—1e +1 
©) (s?+3s+7)(s?+1) “ ° 
SECTION 2.10 
mM 3, 245 A3 , /a 
° gttl ° (s?+ a2) ° 85° S 
aT . Ss — , _ 5 -t42,-3! 
7. (a) 5 arctans; (b) In ae ; (c) In~ 9, —3+2e-'+4e 
11. cosh V2! 4 3 sinh VST 1 e3t/2 3. 13-10 —1 
V57 2 


15. i(cost—te~‘) 19. y(t)=cost+$sint—trcost 21. y(th=tre! 


V3 1 


cos —~-- — 


- 7 
23. y(t)=1+e7'+ —_. sj 
a4 V3 


nV3t Jer" 
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Answers to odd-numbered exercises 


SECTION 2.11 
1. y(t)=(24+ 3te7'+2H,(t)[ (t-S)+(t- Dew" J 
3. y(t) =3cos2¢—sin 21+ 4(1 —cos2t)— 5(1 —cos2(t—4)) H,(0) 


5. y(t)=3cost—sint+4fsint+ +H, /21)| (¢- $7) sint —cos¢ | 


7. noma Oe 1)+ (cos ~ sin GT New 
— (t-2) 
— H,(t)(7t—1)— H3(t)| 13cos V27 a 


V27 (t-2 
+V27 sin) jew] 


9. y(t)=te' + H(t)[2+1+(2t—-S)e'~!]— Hh [1+1+(2t-7e"?] 


SECTION 2.12 

3. (a) y(t) =(cosh $¢—3 sinh $/)e*/* —2H,(1)sinh $(t —2)e*~ 9? 
5. y(2) = d(sint —tcost)— H,(t)sint 

7. y(t)=3te'+307e~'+3H3()(t—3)e"¢~-) 


SECTION 2.13 


—e@ asinat— bsinbt 5 sinat — at cosat 
b-a ; a2 — b? ; 2a 
9. 1¢sint =. (¢-2)4+(t4+2)e7' = 13. y(N=t+2sinde «15. y(N= 40? 
17. y()=ssint+(1—4ne' 


7. t—sint 


SECTION 2.14 


lL. x(th=cje*+3ce", y(t)=c,e*+2c,e% 
1 2 I 2 


~20, y(t) = [c, cos t + cz sin f]e~*" 


3. x(f)=[(c, + cz) sin t + (c, — cz) cos te 
§. x(N=le%+ie', y(th=te%—te! 
7. x(t)=coste', y(t)=(2cost+sinte' 
9. x(t)=2(tcos¢+3rsint+sin‘e’, y(t)=—2tsint e’ 
11. x(t)= —4sint—itsint—tcost+5costln(sect + tan), 
y(t) = —tsint—tcost—+sin?tcost—Ssintcost+5sin7t 


— i sin? ¢ + 5(cos¢—sin 1) In(sec¢ + tan £) 


SECTION 2.15 


lL. y(D=cye'+oe-'+c3e%" = - 3. p(D=(eytegttcstye*+c4e-' 5. y(t) =0 
7. y()= —3-2t-174+(3-—De! 9. ¥()=1-—cost—Incost—sintln(sect+ tans) 
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Answers to odd-numbered exercises 


~~! Tens _ 
11. VO= =e J [ sin(t s)/V2 cosh(t—s)/V2 
—cos(t—s)/V2 sinh(t—s)/V2 |g(s)ds 
13, Y()=4t(e-*—1)—tsint 15. ¥(=1| (4-40) cost+(3 + b¢- 4 07)sine| 
17. P(O=t-—1+ hte‘ 


Chapter 3 
SECTION 3.1 
Lij=x. 3 4=x%, 7. x=(_? a x3)=(?) 
X72 = X3 X72 = X3 
x3= _ x3 X4=Xq 


Xg=1-x; 


0 0 -!1 2 1 2 —] 
9, (i 0 0) «-0=(3) 11. | 3} | 0) | ‘ 
0 -!1 0 4 — | —] 2 
11 —7 5 ~—] 17 
13. (a) | 16 J; (b) 2}1;(c)110};(@)]1 9 15. A-( 2 ‘} 
7 —3 1 1 —2 4 


SECTION 3.2 
1. Yes 3. No 5. No 7. Yes 9. No 11. Yes 


SECTION 3.3 
1. Linearly dependent 3. Linearly independent 
5. (a) Linearly dependent; (b) Linearly dependent 
7. db) y(=e, yN=e"! 9 p(N=t-1, p)=P-6 
—] 
11. °-(=1] 17. x,;=y, 


l 


] 
X= ——()2- 
2 ya y3) 


1 
3 yy? y3) 


SECTION 3.4 


L xine cos V3 1/2 e~'/2 
“*O= (—4cos V3 ¢/2—4V3 sin V3 ¢/2)e7'/? 


sin V3 ¢/2 e~'/? | 


2 — 
*W) ios t/2-14sin V3 t/2)e~'/? 
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Answers to odd-numbered exercises 


3. x'=( 0 ), o=(¢ | 5. Yes 7. Yes 9. No _ 11. (b) No 
le 


e! 


SECTION 3.5 
3. —48 5. 0 7. —97 11. No solutions 
x} l x —] 
*2}_1]0 X2/_ | -2 
13. x,/~10 15 x, Cc ; 
X4 0 x4 4 
SECTION 3.6 
1 2 25 1S $52 0 
1. AB=/| 1 7 10],BA=/] 5 -1 1 
10 6 12 10 «#18 #6 
1 1 2 1 2 1 
3. AB={1 2 1/],BA=/2 1 1 
2 1 #1 I 1 2 


I. a5 18 —6 18 ll. 5 1-2i -1l —I1+i 


6 14 —18 1—2i 1 1+i 
13. 5 —] ] l-i 
] —-] -1-ji 
SECTION 3.7 
b, 2b, + 3b, ] —4q 
1. b= b, 3. b= b» 5. 4 -i)+ 2 
b, +2b, b, l 0 
l 
7. No solutions 9. x=! 0 11. A=1,-1 
0 
13. (a)A=—1; (b)x=— —] |+cl —7 
0 6 
SECTION 3.8 


1 3 ] 0 2 
1. x(Q=ey( 5 )e*+02( 3 Jem 3. x(t)=c,| O]e~*+c| 2 Je~'+c;] 1 Je® 
—] 


—] 2 
0 l 
co{ 1 ]}+c31 0 | je” 
0 2 


2 
5. xm oe 
—] 
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Answers to odd-numbered exercises 


I 4({ -7 4 (1 
13, (b) x(4)=9{ O Jes '+3)  2Je~@" P+ Zl Jere 
1 3\ 13 3\y 


SECTION 3.9 


_ 2 2sint 21 
I. x()= je | Lint) cost reine) | 


t 2 0 0 _ COs ft 
3. x(t)=e"| c)[ —2 ] +e. cos 2t + C3 sin 2t 5. x(t) = Icost+sint }° 


3 sin2t —cos2t 


cos V2 t— V2 sin V2 t 


2 —V2 sinV2 t—V2 cosV2 t 
x(t) | 2 
—3cos V2 t 


SECTION 3.10 


" oma r 14 a(t )ro() 
] 
—T 

3. xomal0 . ~ ty alo J+ 1 e! 

0 
1—t 
5. xo=(0 Je 7. x(t)= ( f Je 
] t 


Te'— Se+ 8e-! — eft Se%t-— fe! 
13. (b) a 2le'— Se*—Il6e7' -—3e'+ S5e%+ 8e-! 
14e'+10e%’—24e-' —2e'—10e%"+12e7! 


A (at 
15, 1+ = (e*—1) 


SECTION 3.11 


e244 5e2t — et 5e2! —5e3! e~ 2! — et 
1. 5 —e~% 4 e3t 5e3t —e 24 03! 
4e~74 Se 4 e3 = — 5 e244 5¢3! fe — 24 4 93 
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Xj 
9. x0-( 0 
X3 


3e'— Se*—Je™! 
9e'— Se**—de-! 
6e' + 10e7' —6e~' 


Answers to odd-numbered exercises 


, 6e2'—cost—2sint —2e2'+2cost+4sint 2e7'—2cost+sint 
3. 5 | 3e?"—3cost—sint —e*+6cost+2sint —cost+3sint 
Ssint —10sint S5cost 
(t+1l)e~’ (ttle '-e*% e'-e 
5. —te~! — te !'+e e~7#— en! 
te‘ te~' e! 
3.1 -1 , (16 -25 30 
7A=11 3 —-!1 9, A= 73 8 —6 —24 11. No 
3 3 -!1 0 13 26 


SECTION 3.12 


l x(1)=2e'( 100s! + 3/sin/-+ sins) 

; —tsint 

— 4sint — ttsint — tcost+Scostln(sec¢ + tan?) 

— tsint — tcost+sintcos?t— 4 sin*tcost 

3. x(t)= 
— 5sintcost+5sin?t—+sin?¢ 
+ 5(cos¢— sin?) In(sec¢ + tan Z) 


303! —2e2! — te! 324 1p4l 
2 1 
—ir—tt} 


5. x(t) = e2! 


3e7! —2e! 


2 1 
13. o-( -2)é 17. Y()= (- Je 
—] —] 
SECTION 3.13 
1. x(j=e-"(3) +e%( 3) 


3. aiy=—$ (era H(2)ea4 (0) (1) 43( Le! 
5. x(t)= “(1 3p ) 


11. Ho-| 


7 x()=2e'( fost + 3/sin/ + sin#) 


—tsint 
(mgs) tera 
9. x(t)= ; 1. x()=| ¢ Je” 
cos Zt t 
( inde isae) fo 
t—1?—iP-—it+te 1 
13. x(t)= ~1)2 15, x(t)= re e3! 
r+ie 1+21 
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Answers to odd-numbered exercises 


Chapter 4 

SECTION 4.1 

1. x=0,y=0; x=0,y=1; x=1,y=0; x=3,y=4 
Nya rat: val ve _ [ea 

3. x=0, y=0, z=0; r= Ga Gra-(S4S 


5. x=0, y=Yo; | Yo arb. 7. x=0, y= —2,2=n0 
X=Xp, V=1;° Xo arb. 


X=Xq Y=—1; Xo arb. 
SECTION 4.2 
1. Stable 3. Unstable 5. Asymptotically stable 
7. Asymptotically stable 9. Stable 


11. x(t)=1 is stable; x(¢)=0 is unstable. 


SECTION 4.3 


1. x=0, y=0 is unstable; x=1, y=0 is unstable; 
x=—1, y=0 is unstable; x=0, y=2!/* is stable; 
x=0, y= —2!/4 is stable. 


3. x=0, y=1 is unstable; x=0, y=—1 is unstable; 
x=1, y=0 is unstable; x=—1, y=0O1s stable. 


5. x =0, y=nm is unstable for all integers n. 
7. Unstable 9. Unstable 11. Stable 13. Stable 
15. Unstable 17. Unstable 


SECTION 4.4 
1. y=cos(x—1)? 3. y=tanx 5, x?+y?=c? 


7. The points x= Xo, y=yo with x9 +yo= —1, and the circles x?+y?=c?, minus 
these points. 


9, The points x=0, y=yp;_ x= Xo, y=0, and the curves y = ce ~ 2/9", x >0; 
y=ce 2/3)" x <0, 


11. The points x =0, y=yo, and the curves 


(ad— bc) 
(by — dx) — —q Inle- =k, x>0 


(ad— bc) 
(by — dx) — —z— Inle- a =k, x<0 


13. The point x=0, y=0; and the curves xy?—1x°=c. 

Section 4.5.1 

3. = - 5 nb +09)= x +k §. z -== J In(b + dy) % In(atex)+k 
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Answers to odd-numbered exercises 


SECTION 4.7 
13. (b) y2= x7 4+ x44 ce? 


SECTION 4.8 ° 


5. The circle x7+y*=1. Note that every point on this circle is an equilibrium 
point. 


7. The circles x? +y?=(2n+1)7/2 


SECTION 4.9 
l. e=—-1,1 
3. e=—2,2 


5. No bifurcation points 


SECTION 4.10 


1. x=0, y=0 is unstable; x =(a/e), y=0 is stable if ad< ec, and unstable if 
ad > ec; x =(af+bc)/(ef+ bd), y =(ad—ec)/(bd+ ef) exists only for ad > ec 
and is stable. 

3. x, =0, x.=0, y=0; x, =c/d, x,»=nc/[d(a,+a))], y =na,— a} — aja); 
assuming that na, > a?+ aa. 

. l+a, l—a, 

5. (b) @) y1= T+a,a;’”? 


_ . oe _ —/f)- —! 
l—a,a,’ (11) y| 1, y2 0; (c) a= 3 


SECTION 4.12 


3. (a) rI+AI?7/2=ylnS—rS+c; (b) Yes 
5. (a) 0.24705; (b) 0.356; (c) 0.45212; (d) 0.60025; (e) 0.74305; (f) 0.77661 


SECTION 4.13 


7. Ei Byc— a 
. Either x(t)->0 or *()>— (for B,c—a,>0) 
Chapter 5 
SECTION 5.1 

(2n+1)r _ (Qn+1)7x 
1. A, = 4p YO) = sina — 

_ pra 

3. A=0,y=c; A= = ; y(x)=ccos “7 


3. y(x)=csinh V —Ag x, where sinh V —Ay t= V —Ay cosh V —Ag 7; 
y(x)=csin VA, x, where tan VA, T= VA, ; 


7, A=—1, y(x)=ce*; A=n?, y(x)=c[ncosnx+sinnx] 
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Answers to odd-numbered exercises 


SECTION 5.3 


1. u(x, t)=sin baxe~'7 "4/4 4.3 sin Saxe ~ 1-7 25071/4 


(1 —40r2)¢ _ Tsin 4ax ee! - 162)t 


. u(x, t)=3sin27xe 
» u(t, y) = elt) + e304) 
. u(t,y)=e"e—Y +2e- Me — 14el3iely 


. (a) X”-pX=0; Y”-(pt+dA)Y=0; F’—do?T=0; 
nTrx niy oan? ab? — a2)t /a?b? 
b 


w uN oO & 


(b) u(x, y,H= sin sin —— 


SECTION 5.4 
4 sinvx , sin3a7x , sinSrx 
Ll. f(x)= oe ae + OO | 
2[sin7vx sin2ax , sin3ax 
3. f(xy= 5 | SA — SO Sm | 
3 121 nx nv nx 
5. Gs n [sin COS ~~ +(1 cos") sin” | 
e’(—1)” ~ n1x nTX 
7. fia ® f+} Soe Os ~ nesin “| 
00 I(e'(-1)"-1) 
e! nix 
SOT 4? Bae ge OT 
2 (-1)"(e —e~‘) NTX nTxX 
11. I(x )= he SE [tos tm sin "a | 
2 co (—1) cosnx 
13. f(x)= 4 sinx —tsin3x 17. a) f(y=% +4 2 — 
SECTION 5.5 
1. & 21-cosna/e) 
2 tna? COS NIrx 
_3a, & 4a(cosna/2 —1) n7rx 
3 f= G+ Lage 8 
(oo) 
5 fi=a+4 > 4 [2c0snm/2 —1—(~1)"]cos* 
Tv n=] n 


7. f(x)= = >» (cos nn /2 — —(-1)" ) sin * 9. f(x)=sin2x 


2, 4[ cos2x , cos4x 
11. (a) sare? ff 132 + a toa 0<x<; 


4/2sin2x . 4sin4x 
b) cosx = — + 
(0) 7| 27—] 4*—] 


ton 0<x<l; (c)No 
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Answers to odd-numbered exercises 


SECTION 5.6 
= 280 3 sin(2n + I)ax/10 — 0.86(2n + 1)22721/100. 
1, (a) u(x, = 7 a Int] © ’ 
0 2n(1—(—1)"cos 10) 
b) u(x,)= in 2X ¢— 0.86n? 174/100, 
©) wes 2 n*2? — 100 0 


(6) u(x,4)= 100 3 | “SMO — 


n=] 


_ 0 ((—1)"—cos37/10 
(4) u(x,)= =? > (pi 2084/0) 


n=] 


5. (b) u(x,t) = 10x +800 x sinnn /2 2 sin 27 el -n? 


«ATX | —0.86n? 2t/100. 
in ——~ e 
nv 100 , 


~— AMX — 2 2, 
in 0 e 0.86n? 171/100 


vo) na? 10 
SECTION 5.7 
4 2n+1 l x ct 
lL. u(x, )=- ————_ - sin(2n + 1)=cos(2n + 1) — 
7 (2n+1)’-4 2n+1 2 2 


co ’ 2 
3. u(x,)= 12 S M2MA/S i, MX Qo, nmcl 


n=l n2 3 3 

= 8 Sl nm ig MEX ogg itt 

5. u(x, ?) ee 2 sin” . sin 7 COST5 
9. u(x,y,D=X(x)¥(y)T(0); Y(y)=a,cos V A?— p* y+ bo sin 


X (x)= a,cospx+b,sinux  T(t)=a;cosAct+ b;sinAct 


SECTION 5.8 
_S&  . am(x-a) | ny _ —2 b . nny 
1. uey= Qe ¢, sinh ——>—— sin a Co= SaahnaaTb J, SSN 
_— Snax. nay). nx 
3. Mey)= Qe Cn SIN sinh ——~—-; C, = canhenbya J J (x)sin— dx 
20 sin(2n—1)=* sinh(2n— 1) 
S. u(x,)=-— 
u(x,y) 2 (2n—1)sinh(2n—1)ab/a 
sinh(2n— 1) 7° sin(2n — 1) - 


BSN 


3 


4495 
‘72 


1 Gan amb on ea 
sinh(2n — 1)7~—4 sin(2n— = 


7 uCoy)=147 2 —GaoisinhQn-aa/b 


9 wxy)= 22 $ (—1)"sinh V1+n?7? xsinnmy 
T n=! nsinh V 1+ na 


1. X”+AX=0; Y"+pY=0; Z”-A+p)Z=0 
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Answers to odd-numbered exercises 


SECTION 6.3 

/3 /5 5/7(_, 12 
1. po(x) = “4” P(x) = “g p2(x) = i (. — 2) 
SECTION 6.4 


7. y(x) = csinh,/ —A x, where sinh,/ —Agz = ./ —A,cosh./ —AoT; 
y(x) = csin,/2,,x, where tan,//,2 = ./A, 


9, A= —1, y(x) = ce*; A = n’, y(x) = c[ncosnx + sinnx] 


574 


Adjoint matrix 315 
Analytic functions 189 
Asymptotic stability 381 
Atomic waste problem 46 
Autonomous systems 376 


Bernoulli equation 67 
Bessel equation 137 
orderv 218 
order zero 220 
Bessel function 
J,(t) 218 
Jo(t) 220 
Boundary conditions for 
elastic string 483 
heat equation 482 
Laplace equation 483 
second order o.d.e. 476, 533 
Boundary-value problem 476, 483, 533 
heat equation 483 
Laplace equation 508 
wave equation 503 


Carbon dating 19 
Cauchy ratio test 189 
Cayley—Hamilton theorem 351 
Characteristic equation 138 
complex roots 141 
real and equal roots 145 
real and unequal roots 138 
Characteristic polynomial 334 
complex roots 341 
real distinct roots 334 
repeated roots 345 


Index 


Combat models 405 
Complex exponentials 141 
Compound interest 57 
Convolution integral 252 


Damping force 166 
Detecting art forgeries 11 
Detection of diabetes 178 
Difference equations 91 
Differential operators 129 
Diffusion equation 481 
Dirac, P. A.M. 244 
Dirac delta function 244 
Laplace transform of 246 
Direction field 397 
Dirichlet problem 483 
Discontinuous forcing function 238 


Eigenfunctions 478 
Eigenvalues of 
boundary-value problem 478, 533 
matrix 333 
multiplicity of 347 
Eigenvectors of a matrix 333 
linear independence of 335 
Elastic membrane 
vibration of 482 
Elastic string 482, 503 
boundary conditions for 483 
fundamental frequencies of 505 
initial conditions for 483 
nonzero initial displacement 505 
propagation of initial conditions 506 
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Index 


Electrical circuits 175 
Epidemic models 38 
gonorrhea 491 
plagues 39 
threshold theorem 460 
Equilibrium points 373 
Error 
effect of step size 102 
for Euler’s method 100 
formula 101 
for improved Euler’s method 110 
round-off 105 
for Runge—Kutta method 113 
for three-term Taylor series 107 
Escape velocity 52 
Euler equation 145, 149, 198 
Euler method 97 
formula error 101 
Even functions 493 
Exact equations 58, 60 
Existence and uniqueness theorems for 
first-order equations 67 
orbits 314 
second-order equations 129 
systems of first-order equations 291, 414 
Extension of a function 
even 495 
odd 496 


Finding roots by iteration 81 
First-order differential equations 1, 2 
exact 58, 60 
existence and uniqueness theorem for 67 
general solutions of 4, 21 
homogeneous 3 
initial-value problem _ 5, 7, 21 
integrating factor for 8, 64 
linear 3 
nonlinear 3 
numerical solutions of, see Numerical 
methods 
separable 21 
several variables 264 
systems of 265 
Fourier series 487, 488 
convergence of 487, 488 
cosine series 494 
Fourier coefficients 489 
Parseval identity 492 
sine series 494 
Frobenius method 203 
Fundamental matrix 355 
Fundamental set of solutions 133 


General solutions 
first-order linear equations 4, 21 
higher-order equations 259 
second-order equations 132, 142, 148 
systems of first-order equations 334 
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Generalized functions 249 
Gompertzian relation 53 
Gonorrhea model 491 
Green’s theorem 437 


Heat equation 481, 499 
bar with insulated ends 500 
boundary conditions 483 
initial conditions 483 
nonhomogeneous boundary conditions 
503 
smoothing of discontinuities in initial 
conditions 499 
Hermite equation 197,515, 539 
Hermite polynomial 197, 541 
Higher-order equations 259 
general solution 259 
variation of parameters 262 
Homogeneous equations 25 
Hypergeometric equations 218 


Identity matrix 311 
Impedance 177 
Improved Euler method 109 
Indicial equation 209 
Initial conditions 482 
for elastic string 483 
for heat equation 483 
propagation of initial conditions for wave 
equation 506 
smoothing of discontinuities for heat 
equation 499 
Initial-value problem _ 5, 7, 21, 128, 259 
Inner product 516 
Integral equation 70 
Integrating factor 8, 64 
Inverse matrix 316 


Jump discontinuities 227, 238 
Judicious guessing of particular solutions 
157 


Kirchoff’s second law 176 


Laguerre equation 218, 539 
Laplace equation 482, 508 
boundary conditions 483 
Dirichlet problem 483 
Neumann problem 483 
Laplace transform 225 
of the convolution 252 
definition 226 
of derivatives 229 
of the Dirac delta function 246 
existence of 227 


inverse of 230 
of systems 368 
Legendre equation 197, 539 
Legendre polynomials 197 
Limit cycles 436 
Linear algebra 271, 296 
Linear independence 136, 281 
of eigenvectors 335 
of vector functions 293 
of vectors 281 
Linear operators 131 
Linear differential equations, see First-order 
differential equations, Second-order 
differential equations, or Systems 
of first-order differential equations 
Linear transformations 324 
Logistic curves 30, 43 
Logistic law of population growth 28 


Malthusian law of population growth 27 
Matrices 267 

addition of 276 

adjoint 315 

characteristic polynomial 334 
determinant of 299 
diagonal 301 

lower 301 

upper 301 

eigenvalues of 333 
eigenvectors of 333 
fundamental 355 

Hermitian 530 

identity 311 

inverse 316 

product of 310 

selfadjoint 530 
Mechanical vibrations 165 
damped forced vibrations 169 
damped free vibrations 167 
forced free vibrations 170 
free vibrations 166 
resonant frequency 173 
Method of elimination 257 
Mixing problems 53 


Natural frequency 171 
Neumann problem 483 
Newtonian mechanics 46, 127, 166 
Newton’s method 88 
Nonlinear differential equations 

autonomous system 376 
Numerical methods 95 

effect of step size 102 

error 100, 102, 107, 110, 113 

Euler 97 

formula error 101 

improved Euler 110 

Runge-Kutta 113 

three-term Taylor series 107 


Index 


Odd functions 493 

Ohm’s law 176 

Orbits 394, 397 

Order of a differential equation 1 
Ordinary differential equations 
definition 1 


Parseval’s identity 492 
Partial differential equations 
definition 481 
Particular solution 151, 364 
Periodic solutions 416 
Phase plane 394 
Phase portrait 418, 419 
Picard iterates 70 
Piecewise continuous functions 227 
Poincaré—Bendixson theorem 433 
Population models 27 
competitive exclusion 451 
Logistic law of population growth 28 
Malthusian law of population growth 27 
sharks 443, 444 
Potential equation 482 
Power series 188 


Qualitative theory 372 


Radioactive dating 12 

Reduction of order 147 

Reduction to systems of equations 265 
Regular singular points 204 
Resonance 170 

Resonant frequency 173 

Reynolds number 50 

Runge—Kutta method 113 


Schwartz, Laurent 245, 248 
Second-order differential equations 127, 138 
characteristic equation 138 
complex roots 141 
real and equal roots 145 
real and unequal roots 138 
Euler equation 145, 150, 198 
existence and uniqueness theorem 129 
fundamental set of solutions 133 
general solutions 132, 142, 148 
homogeneous equation 128 
judicious guessing of particular solution 
157 
nonhomogeneous equation 151 
discontinuous function 238 
particular solution 151 
reduction of order 147 
series solution 185, 188 
singular points 198, 204 
variation of parameters 155 


577 


Index 


Separable equations 20 
Separation of variables 483, 484 
for heat equation 484, 500 
for Laplace equation 508, 510 
for wave equation 503 
Sequence of eigenvalues 478 
Series solution 185, 188 
recurrence formula 187 
when roots of the indicial equation are 
equal 212, 216 
when roots of the indicial equation differ 
by aninteger 212, 216 
Singular points 198, 204 
Solutions, definition 1 
first-order equations 4, 21, 60 
higher-order equations 259 
second-order equations 132, 142, 148 
systems of first-order equations 334 
Spread of technological innovations 39 
Spring-mass dashpot system 165 
damped vibrations 167, 169 
forced vibrations 170 
free vibrations 166 
Stability 
asymptotic 381 
of equilibrium points 385, 386 
linear systems 378 
ofasolution 373, 378 
Sturm—Liouville boundary value problem 
533 
Systems of algebraic equations 297 
Systems of linear first-order equations 265 
characteristic polynomial 334 
complex roots 341 
real distinct roots 334 
repeated roots 345 
definition 265 


578 


existence and uniqueness 291 
fundamental matrix 355 
general solution of 334 


nonhomogeneous 
reduction to 265 
stability of 378 


360 


variation of parameters 


Tacoma Bridge disaster 


Taylor series 189 


360 


173 


Tchebycheff equation 197, 539 
Tchebycheff polynomials 


Theory of war 398 
Thermal diffusivity 


481 


Three-term Taylor series 


Trajectories 394 
Tumor growth 52 


Variation of parameters 
for systems of equations 360 


Vectors 266 


197, 541 


106 


155 


linear independence of 281 
solutions of systems of equations 


valued functions 
Vector spaces 273 

basis 287 

dimension of 283 
Volterra, Vito 444 


Wave equation 482 
initial conditions 


solution 504 
Wronskian 133 


266 


, 903 
boundary conditions 


483 


483 


297 


Texts in Applied Mathematics 


(continued from page ii) 


31. Brémaud: Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues. 

32. Durran: Numerical Methods for Wave Equations in Geophysical Fluid Dynamics. 

33. Thomas: Numerical Partial Differential Equations: Conservation Laws and Elliptic 
Equations. 

34. Chicone: Ordinary Differential Equations with Applications. 

35. Kevorkian: Partial Differential Equations: Analytical Solution Techniques, 2nd ed. 

36. Dullerud/Paganini: A Course in Robust Control Theory: A Convex Approach. 

37. Quarteroni/Sacco/Saleri: Numerical Mathematics. 

38. Gallier: Geometric Methods and Applications: For Computer Science and 
Engineering. 

39. Atkinson/Han: Theoretical Numerical Analysis: A Functional Analysis Framework. 

40. Brauer/Castillo-Chdvez: Mathematical Models in Population Biology and 
Epidemiology. 

41. Davies: Integral Transforms and Their Applications, 3rd ed. 

42. Deuflhard/Bornemann: Scientific Computing with Ordinary Differential Equations. 

43. Deuflhard/Hohmann: Numerical Analysis in Modern Scientific Computing: 
An Introduction, 2nd ed. 

44. Knabner/Angermann: Numerical Methods for Elliptic and Parabolic Partial 
Differential Equations. 

45. Larsson/Thomée: Partial Differential Equations with Numerical Methods. 

46. Pedregal: Introduction to Optimization. 

47. Ockendon/Ockendon: Waves and Compressible Flow. 

48. Hinrichsen: Mathematical Systems Theory I. 

49. Bullo/Lewis: Geometric Control of Mechanical Systems: Modeling, Analysis, and Design for 
Simple Mechanical Control Systems. 


